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XENOBIOTIC DETOXIFICATION GENE FROM PLANTS 

This application claims priority to U.S. 
60/101,814, filed September 25, 1998, the entirety of 
which is incorporated by reference herein. 



5 Pursuant to 35 U.S.C. §202 (c), it is 

acknowledged that the U.S. Government has certain rights 
in the invention described herein, which was made in part 
with funds from the National Science Foundation, Grant 
No. IBN-9416016. 

10 

FIELD OF THE INVENTION 

This invention relates to the field of stress 
resistance in plants. In particular, the invention 
provides a novel gene from plants, which encodes an MDR- 
15 like ABC transporter, involved in detoxification of 
certain xenobiotics to protect plants from their 
detrimental effects. 



BACKGROUND OF THE INVENTION 

20 Several publications are referenced in this 

application to describe the state of the art to which the 
invention pertains. Each of these publications is 
incorporated by reference herein. 

Environmental stress is one of the most 

25 important limitations on plant productivity, growth and 
survival. An ever-increasing source of environmental 
stress to plants is the stress caused by environmental 
pollutants in the soil, water and atmosphere. Such 
pollutants include herbicides, pesticides and related 

30 agronomic products, as well as organic and inorganic 

waste material from industry and other sources. Other 
toxic agents that threaten the survival of plants include 
various toxins produced by ephiphytic or soilborne 
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microorganisms, such as fungi and bacteria. 

In order to survive in toxic environments, 
plants must have mechanisms to detoxify xenobiotics, 
heavy metals and other toxic compounds. This generally 
5 involves modification of the toxic compound and 

subsequent excretion into the vacuole or apoplastic 
space. Recently, certain ATP-binding cassette (ABC) 
transporters have been identified in plants, which appear 
to be involved in the detoxification process. 

10 The ABC transporter family is very large, with 

representatives existing in many different classes of 
organisms. Two well studied groups of ABC transporters, 
encoded by mdr and mrp genes, respectively, are 
associated with the multi-drug resistance phenomenon 

15 observed in mammalian tumor cells. The mdr genes encode 
a family of P-glycoproteins that mediate the energy- 
dependent efflux of certain lipophilic drugs from cells. 
The mrp genes encode a family of transporters that 
mediate the extrusion of a variety of organic compounds 

20 after their conjugation with glutathione. YCF1 , the 
yeast homolog of mrp, encodes a protein capable of 
glutathione-mediated detoxification of heavy metals. 

Homologs of mrp and mdr genes have been 
identified in plant species. In Arabidopsis thaliana, 

25 the glutathione-conjugate transporter encoded by the mrp 
homolog is located in the vacuolar membrane and is 
responsible for sequestration of xenobiotics in the 
central vacuole (Tommasini et al . , FEBS Lett. 411 : 206- 
210, 1997; Li et al . , Plant Physiol. 107: 1257-1268, 

30 1995) . An mdr-like gene (atpgpl) has also been 

identified in A. thaliana, which encodes a putative P- 
glycoprotein homolog. The atpgpl gene was found to share 
significant sequence homology and structural organization 
with human mdr genes, and was expressed with particular 
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abundance in inflorescence axes (Dudler & Hertig, J. 
Biol, Chem. 267 : 5882-5888, 1992). Other MDR homologs 
have been found in potato (Wang et al., Plant Mol. Biol. 
31: 683, 1996) and barley (Davies et al., Gene 199: 195, 
5 1997). 

The aforementioned mrp and mdr plant homologs 
were identified as a result of an effort to understand 
the molecular basis for development in plants of cross- 
resistance to herbicides of unrelated classes. However, 

10 these transporters are likely to serve the more general 
role in plants of sequestering, secreting, or otherwise 
detoxifying various organic and inorganic xenobiotics. 
Accordingly, it will constitute an advance in the art of 
plant genetic engineering of stress tolerance to identify 

15 and characterize other members of this class of 
transporters in plants. 

SUMMARY OF THE INVENTION 

In accordance with the present invention, a new 
20 plant mdr homolog has been identified. Unlike the 

previously identified plant mdr homologs, this new gene 
is inducible by a class of compounds known to inhibit 
chloride ion channels. 

According to one aspect of the invention, a 
25 nucleic acid isolated from a plant is provided, which 

encodes a p-glycoprotein that is inducible by exposure of 
the plant to NPPB. The isolated nucleic acid is 
preferentially expressed in plant roots upon exposure of 
the plant to NPPB. In a preferred embodiment, the plant 
30 from which the nucleic acid is isolated is selected from 
the group consisting of Brassica napus and Arabidopsis 
thaliana and is 3850-4150 nucleotides in length. In a 
more preferred embodiment, the nucleic acid has the 
restriction sites shown in Figure 4 for at least three 
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restriction enzymes. In particularly preferred 
embodiments, the nucleic acid molecule encodes a 
polypeptide having SEQ ID NO: 2. In an exemplary 
embodiment, the nucleic acid is a cDNA comprising the 
5 coding region of SEQ ID NO:l or SEQ ID NO: 10, 

According to another aspect of the invention is 
an expression cassette that comprises a plPAC gene 
operably linked to a promoter, and in a more preferred 
embodiment the plPAC gene is from Arabidopsis. In 

10 preferred embodiments, the expression cassette comprises 
the cauliflower mosaic virus 35S promoter, and part of 
all of SEQ ID NO:l or SEQ ID NO: 10. Further included in 
this aspect is a vector comprising the expression 
cassette and a method for producing transgenic plants 

15 with the expression cassette and vector. 

Another aspect of the invention are transgenic 
cells and plants containing the nucleic acids of the 
invention. In one preferred embodiment, the nucleic 
acids are be in the aforementioned expression cassette. 

20 Furhter included in this aspect are reporductive units 
from the transgenic plant. 

According to another aspect of the invention, 
an isolated nucleic acid molecule is provided, which has 
a sequence selected from the group consisting of: a) SEQ 

25 ID NO:l and SEQ ID NO: 10; b) a nucleic acid sequence 
that is at least about 60% homologous to the coding 
regions of SEQ ID N0:1 or SEQ ID NO: 10; c) a sequence 
hybridizing with SEQ ID NO:l or SEQ ID NO: 10 at moderate 
stringency; d) a sequence encoding part or all of a 

30 polypeptide having SEQ ID NO: 2; e) a sequence encoding an 
amino acid sequence that is at least about 70% identical 
to SEQ ID NO: 2; f) a sequence encoding an amino acid 
sequence that is at least about 80% similar to SEQ ID 
NO: 2; g) a sequence encoding an amino acid sequence that 
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is at least about 40% similar to residues 1-76, 613-669 
or 1144-1161 of SEQ ID N0:2; and h) a sequence 
hybridizing at moderate stringency to a sequence encoding 
residues 1-76, 613-669 or 1144-1161 of SEQ ID NO:2. A 
5 polypeptide produced by expression of the above listed 
sequences is also provided. 

According to another aspect of the invention, 
an isolated plant p-glycoprotein, which is inducible upon 
exposure of the plant to NPPB, is provided. The 

10 polypeptide preferably confers upon a cell in which it is 
found resistance to Rhodamine 6G. The polypeptide is 
preferentially produced in roots upon the exposure to the 
NPPB . The polypeptide is preferrably from Brassica napus 
or Arabidopsis thaliana. In most preferred embodiments, 

15 the polypeptide has a sequence that is a) an amino acid 
sequence that is at least 80% similar to SEQ ID NO: 2; b) 
an amino acid sequence that is at least 70% identical to 
SEQ ID NO: 2; c) an amino acid sequence that is at least 
40% similar to residues 1-76, 613-669 or 1144-1161 of SEQ 

2 0 ID NO: 2; and d) an amino acid sequence encoded by a 

nucleic acid sequence hybridizing at moderate stringency 
to a amino acid sequence encoding residues 1-76, 613-669 
or 1144-1161 of SEQ ID NO:2. 

According to other aspects of the invention, 

25 antibodies immunologically specific for the polypeptides 
of the invention are provided, that immunologically 
specific to any of the polypeptides, of polypeptide 
encoded by the nucleic acids of the invention. In a 
preferred embodiment, the antibody is immunospecif ic to 

30 residues 1-76, 613-669 or 1144-1161 of SEQ ID N0:2. 

According to another aspect of the invention, a 
plant p-glycoprotein gene promoter, which is inducible by 
NPPB, is also provided. In a preferred embodiment, the 
promoter is part or all of residues 1-3429 of SEQ ID 
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NO:10. 

According to another aspect of the invention, 
plants that have reduces levels of plPAC protein are 
provided. In a preferred embodiment , these plants have 
5 mutations in the plPAC gene, and in a particularly 

preferred embodiment, the plPAC gene is mutated due to 
the insertion of a T-DNA. Also provided with this aspect, 
is a method for selecting plants with mutations in plPAC 
using SEQ ID NOS: 11-14 as PCR primers. 
10 These and other features and advantages of the 

present invention will be described in greater detail in 
the description and examples set forth below. 

BRIEF DESCRIPTION OF THE DRAWINGS 
15 Figure 1. Amino acid sequence lineup of ATPAC 

deduced amino acid sequence and the amino acid sequences 
of related mammalian and plant genes. The lineup shows 
the ATPAC deduced amino acid sequence (SEQ ID NO: 2) 
compared with (1) hmdrl (SEQ ID NO:3); (2) mmdrl (SEQ ID 
20 NO: 4); (3) hmdr3 (SEQ ID NO: 5); (4) mmdr2 (SEQ ID NO: 6); 
(5) atpgpl (SEQ ID NO:7); and (6) atpgp2 (SEQ ID NO:8). 
A consensus sequence (SEQ ID NO: 9) is also shown. 

Figure 2. Graph depicting the effect of 
rhodamine 6G on the growth rate of cells transformed with 
25 and expressing ATPAC as compared with control cells not 
containing ATPAC. 

Figure 3. Restriction map of genomic clone of 
ATPAC , SEQ ID NO: 10. 

Figure 4. Restriction map of cDNA clone of 
30 ATPAC, SEQ ID NO:l. 



DETAILED DESCRIPTION OF the INVENTION 
I • Definitions 

Various terms relating to the biological 
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with default parameters, may be used to determine the 
level of identity and similarity between nucleic acid 
sequences and amino acid sequences. 

The term "substantially the same" refers to 
5 nucleic acid or amino acid sequences having sequence 

variation that do not materially affect the nature of the 
protein (i.e. the structure, thermostability 
characteristics and/or biological activity of the 
protein) . With particular reference to nucleic acid 

10 sequences, the term "substantially the same" is intended 
to refer to the coding region and to conserved sequences 
governing expression, and refers primarily to degenerate 
codons encoding the same amino acid, or alternate codons 
encoding conservative substitute amino acids in the 

15 encoded polypeptide. With reference to amino acid 
sequences, the term "substantially the same" refers 
generally to conservative substitutions and/or variations 
in regions of the polypeptide not involved in 
determination of structure or function. 

20 The terms "percent identical" and "percent 

similar" are also used herein in comparisons among amino 
acid and nucleic acid sequences. When referring to amino 
acid sequences, "percent identical" refers to the percent 
of the amino acids of the subject amino acid sequence 

25 that have been matched to identical amino acids in the 
compared amino acid sequence by a sequence analysis 
program. "Percent similar" refers to the percent of the 
amino acids of the subject amino acid sequence that have 
been matched to identical or conserved amino acids. 

30 Conserved amino acids are those which differ in structure 
but are similar in physical properties such that the 
exchange of one for another would not appreciably change 
the tertiary structure of the resulting protein. 
Conservative substitutions are defined in Taylor (1986, 
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J. Theor. Biol- 119:205). When referring to nucleic acid 
molecules, "percent identical" refers to the percent of 
the nucleotides of the subject nucleic acid sequence that 
have been matched to identical nucleotides by a sequence 
5 analysis program. 

With respect to protein, the term "isolated 
protein" or "isolated and purified protein" is sometimes 
used herein. This term refers primarily to a protein 
produced by expression of an isolated nucleic acid 

10 molecule of the invention. Alternatively, this term may 
refer to a protein which has been sufficiently separated 
from other proteins with which it would naturally be 
associated, so as to exist in "substantially pure" form. 
The term "substantially pure" refers to a 

15 preparation comprising at least 50-60% by weight the 
compound of interest (e.g., nucleic acid, 
oligonucleotide, protein, etc.). More preferably, the 
preparation comprises at least 75% by weight, and most 
preferably 90-99% by weight, the compound of interest. 

20 Purity is measured by methods appropriate for the 

compound of interest (e.g. chromatographic methods, 
agarose or polyacrylamide gel electrophoresis, HPLC 
analysis, and the like) . 

With respect to antibodies of the invention, 

25 the term "immunologically specific" refers to antibodies 
that bind to one or more epitopes of a protein of 
interest, but which do not substantially recognize and 
bind other molecules in a sample containing a mixed 
population of antigenic biological molecules. 

30 With respect to oligonucleotides, the term 

"specifically hybridizing" refers to the association 
between two single-stranded nucleotide molecules of 
sufficiently complementary sequence to permit such 
hybridization under pre-determined conditions generally 
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used in the art (sometimes termed "substantially 
complementary"). In particular, the term refers to 
hybridization of an oligonucleotide with a substantially 
complementary sequence contained within a single-stranded 
5 DNA or RNA molecule of the invention, to the substantial 
exclusion of hybridization of the oligonucleotide with 
single-stranded nucleic acids of non-complementary 
sequence. 

The term * expression cassette" , as used herein, 

10 comprises 5 f and 3 1 regulatory regions operably linked to 
a coding sequence. The coding sequence may be in the 
sense or antisense orientation with respect to the 5' 
regulatory region. 

The term "promoter region" refers to the 5 1 

15 regulatory regions of a gene. 

The term * reporter gene" refers to genetic 
sequences which may be operably linked to a promoter 
region forming a transgene, such that expression of the 
reporter gene coding region is regulated by the promoter 

20 and expression of the transgene is readily assayed. 

The term "selectable marker gene" refers to a 
gene product that when expressed confers a selectable 
phenotype, such as antibiotic resistance, on a 
transformed cell or plant. 

25 The term "operably linked" means that the 

regulatory sequences necessary for expression of the 
coding sequence are placed in the DNA molecule in the 
appropriate positions relative to the coding sequence so 
as to effect expression of the coding sequence. This 

30 same definition is sometimes applied to the arrangement 
of coding sequences and transcription control elements 
(e.g. promoters, enhancers, and termination elements) in 
an expression vector. 

The term " DNA construct" refers to genetic 
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sequence used to transform plants and generate progeny 
transgenic plants. These constructs may be administered 
to plants in a viral or plasmid vector. Other methods of 
delivery such as Agrobacterium T-DNA mediated 
5 transformation and transformation using the biolistic 
process are also contemplated to be within the scope of 
the present invention. The transforming DNA may be 
prepared according to standard protocols such as those 
set forth in "Current Protocols in Molecular Biology", 
10 eds. Frederick M. Ausubel et al . , John Wiley & Sons, 
1995. 

The term *xenobiotic" refers to foreign 
chemicals or agents not produced or naturally found in 
the organism. The term is commonly used in reference to 
15 toxic or otherwise detrimental foreign chemicals, such as 
organic pollutants or heavy metals. 

II. Description of plPAC 

and its Encoded Polveptide 

20 

In accordance with the present invention, a 
nucleic acid encoding a novel ATP-binding-cassette (ABC) 
transporter has been isolated and cloned from plants. 
The nucleic acid is referred to herein as plPAC. 

25 A cDNA clone of the plPAC from Arabidopsis 

thaliana, an exemplary plPAC of the invention, is 
described in detail herein and its nucleotide sequence is 
set forth in Example 1 as SEQ ID NO:l. This nucleic acid 
molecule is referred to as * ATP AC . It is 36% identical 

30 and 51% similar to human mdrl across the entire sequence. 
It is 51% identical to the atpgpl gene reported by Dudler 
& Hertig (1997, supra) and 50% identical to atpgp2, a 
close homolog of atpgpl, published in the Genbank 
database. ATP AC protein is 65% similar to atpgpl and 

35 atpgp2 proteins. 
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A partial clone of a plPAC of the invention was 
originally isolated from Brassica napus via differential 
expression screening of plants grown in the presence or 
absence of the chloride channel blocker, 5-nitro-2- (3- 
5 phenylpropylamino) benzoic acid (NPPB) . A 0.5 kb gene 
fragment was identified, which had been up-regulated in 
response to NPPB treatment. This cDNA fragment was used 
to screen an Arabidopsis cDNA library, from which the 
complete ATPAC clone was isolated. The isolation and 

10 characterization of ATPAC is described in Example 1. 

A genomic clone of ATPAC (SEQ ID NO: 10) has 
also been isolated from a bacterial artificial chromosome 
(BAC) library of the Arabidopsis genome (BAG clone IGF 
F3J22, obtained from the Arabidopsis stock center, Ohio 

15 State University) . A 7 kb fragment containing part of 
ATPAC and additional 5' regulatory sequences was 
subcloned into a plasmid vector (pBluescript ) . A 
restriction map of ATPAC is found in Fig. 3. The 
corresponding cDNA clone of ATPAC is found in SEQ ID NO:l 

20 and its restriction map is Fig. 4. 

Among the unique features of this nucleic acid 
molecule as compared with other mdr-like genes from 
plants are its inducibility by certain compounds, 
including NPPB and herbicides, and its preferential 

25 expression in roots. The promoter regulatory region of 
ATPAC comprises residues 1-3429 of SEQ ID NO: 10. 

Although the ATPAC cDNA clone from Arabidopsis 
thaliana is described and exemplified herein, this 
invention is intended to encompass nucleic acid sequences 

30 and proteins from other plant species that are 

sufficiently similar to be used instead of ATPAC nucleic 
acid and proteins for the purposes described below. 
These include, but are not limited to, allelic variants 
and natural mutants of SEQ ID NO:l, which are likely to 
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be found in different species of plants or varieties of 
Arabidopsis. 

Because such variants are expected to possess 
certain differences in nucleotide and amino acid 
5 sequence, this invention provides an isolated plPAC 
nucleic acid molecule having at least about 60% 
(preferably 70% and more preferably over 80%) sequence 
homology in the coding regions with the nucleotide 
sequence set forth as SEQ ID NO:l or SEQ ID NO: 10 (and, 

10 most preferably, specifically comprising the coding 

region of SEQ ID NO:l). Also provided are nucleic acids 
that encode a polypeptide that is at least about 40% 
(preferably 50% and most preferably 60%) similar to 
residues 1-76, 613-669 or 1144-1161 of SEQ ID NO:2. Also 

15 provided are nucleic acids that hybridize to the nucleic 
acids of SEQ ID NO:l, SEQ ID NO: 10, or nucleic acids 
encoding the regions of residues 1-76, 613-669 or 1144- 
1161 of SEQ ID NO: 2, preferably under moderate stringency 
(more preferably, high stringency, and most preferably, 

20 very high stringency) . 

In other preferred embodiments, the nucleic 
acids have a restriction digest map that is identical for 
at least 3 enzymes (more preferably 6 enzymes and most 
preferably 9 enzymes) to the maps shown in Figs. 3 or 4. 

25 In another preferred embodiment, the nucleic acids have a 
restriction digest map identical to those shown in Fig. 3 
for enzymes Xhol, Xcml and Spel (preferably additionally 
Sad, Pad and Bsal, and most preferably additionally 
Acil, BanI and SnaBI) . In another preferred embodiment, 

30 the nucleic acids have a restriction digest map identical 
to those shown in Fig. 4 for enzymes Xbal, TatI and Neil 
(preferably additionally Dral, BsmI and Bell, and most 
preferably additionally AccI, Bsgl and Tlil) . The 
nucleic acids of the invention are at least 20 nucleic 
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acids in length (preferably at least 50 nucleic acids and 
most preferably at least 100 nucleic acids) . 

In accordance with the invention, novel plPAC 
genes from two plant species, Brassica napus and 
5 Arabidopsis thaliana, are presented. This constitutes 
the first description of this unique p-glycoprotein in 
plants- Indeed, the closest known protein sequence, also, 
from Arabidopsis, is only 65% identical suggesting that 
the ATPAC gene is novel and is expected to have novel 

10 properties* The isolation of two plPAC genes from 

different species enables the isolation of further plPAC 
genes from other plant species. Isolated nucleic acids 
that are plPAC genes from any plant species are 
considered part of the instant invention. In particular, 

15 the nucleic acids of other plPAC genes can be isolated 
using sequences of ATPAC that distinguish plPAC genes 
from other plant mdr genes according to methods that are 
well known to those in the art of gene isolation. In 
particular, sequences that encode residues 1-76, 613-669 

20 and 1144-1161 of SEQ ID NO:2 can be used. In a preferred 
embodiment, the plPAC gene is from any higher plant 
species (more preferred from a dicot species, and most 
preferred from a species in Brassicaceae (or 
Cruciferae) ) . 

25 This invention also provides isolated 

polypeptide products of the open reading frames of SEQ ID 
NO:l or SEQ ID NO: 10, having at least about 70% 
(preferably 80% and most preferably 90%) sequence 
identity, or at least about 80% similarity (preferably 

30 90% and more preferably 95%) with the amino acid sequence 
of SEQ ID NO: 2. In another embodiment, the polypeptides 
of the invention are at least about 40% identical 
(preferably 50%, and most preferably 60%) to the regions 
of residues 1-76, 613-669 or 1144-1161 of SEQ ID NO:2. 
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Because of the natural sequence variation likely to exist 
among plPAC genes, one skilled in the art would expect to 
find up to about 30-40% nucleotide sequence variation, 
while still maintaining the unique properties of the 
5 plPAC gene and encoded polypeptide of the present 

invention. Such an expectation is due in part to the 
degeneracy of the genetic code, as well as to the known 
evolutionary success of conservative amino acid sequence 
variations, which do not appreciably alter the nature of 

10 the encoded protein. Accordingly, such variants are 

considered substantially the same as one another and are 
included within the scope of the present invention. 

Also provided are transgenic plants transformed 
with part or all of the nucleic acids of the invention. 

15 Transgenic plants that over-express a plPAC coding 
sequence are one embodiment of this aspect of the 
invention. Example 3 provides for one prototype of such 
a plant. In a preferred embodiment, the ATPAC gene is 
used, and in a most preferred embodiment SEQ ID NO:l or 

20 SEQ ID NO: 10 is used. The plPAC gene may be placed under 
a powerful constitutive promoter, such as the Cauliflower 
Mosaic Virus (CaMV) 35S promoter or the figwort mosaic 
virus 35S promoter. In a preferred embodiment, the 
35SCaMV promoter is used. Transgenic plants expressing 

25 the plPAC gene under an inducible promoter (either its 
own promoter or a heterologous promoter) are also 
contemplated to be within the scope of the present 
invention. Inducible plant promoters include the 
tetracycline repressor/operator controlled promoter. In 

30 a preferred embodiment, a native plPAC promoter is used, 
and in a most preferred embodiment, residues 1-3429 of 
SEQ ID NO: 10 is used. Plant species that are 
contemplated for overexpression of a plPAC coding 
sequence include, but are not limited to, soybean. 
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In another embodiment, overexpression of plPAC 
is induced to generate a co-suppression effect. This 
excess expression serves to promote down-regulation of 
both endogenous and exogenous plPAC genes. 
5 In some instances, it may be desirable to down- 

regulate or inhibit expression of endogenous plPAC in 
plants possessing the gene. Accordingly, plPAC nucleic 
acid molecules, or fragments thereof, may also be 
utilized to control the production of pIPAC-encoded P- 

10 glycoproteins. In one embodiment, full-length plPAC 
antisense molecules or antisense oligonucleotides, 
targeted to specific regions of pIPAC-encoded RNA that 
are critical for translation, are used. The use of 
antisense molecules to decrease expression levels of a 

15 pre-determined gene is known in the art. In a preferred 
embodiment, antisense molecules are provided in situ by 
transforming plant cells with a DNA construct which, upon 
transcription, produces the antisense sequences. Such 
constructs can be designed to produce full-length or 

20 partial antisense sequences. One example of antisense 
plPAC transgenic plants is given in Example 3. 

In another embodiment, knock-out plants are 
obtained by screening a T-DNA mutagenized plant 
population for insertions in the plPAC gene (see Krysan 

25 et al., 1996, PNAS 93:8145). One example of this 
embodiment of the invention is found in Example 3. 
Optionally, transgenic plants can be created containing 
mutations in the region encoding the active site of 
plPAC. These last two embodiments are preferred over the 

30 use of anti-sense constructs due to the high homology 
among P-glycoproteins . 

The promoter of ATP AC is also provided in 
accordance with the invention. This promoter has the 
useful properties of root expression and inducability by 
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NPPB. The prototypic example of this aspect of the 
invention is residues 1-3429 of SEQ ID NO: 10. It is 
anticipated that plPAC genes from other plant species 
will likewise exhibit the aforementioned useful 
5 properties. As these promoter regions can easily be 

isolated from the plPAC genes that are provided with the 
invention, all plant plPAC gene promoters are provided 
with the invention. The nucleic acids of the invention 
therefore include a nucleic acid molecule that is at 

10 least about 70% identical (preferably 80% and most 

preferably 90%) to the residues 1-3429 of SEQ ID NO: 10. 
Also provided are nucleic acids that hybridize to the 
nucleic acid residues 1-3429 of SEQ ID NO: 10 preferably 
under moderate stringency (more preferably, high 

15 stringency, and most preferably, very high stringency) . 

The present invention also provides antibodies 
capable of immuno- specif ically binding to polypeptides of 
the invention. Polyclonal or monoclonal antibodies 
directed toward any of the peptides encoded by plPAC may 

20 be prepared according to standard methods. Monoclonal 
antibodies may be prepared according to general methods 
of Kohler and Milstein, following standard protocols. In 
a preferred embodiment, antibodies are prepared, which 
react immuno-specif ically with various epitopes of the 

25 plPAC-encoded polypeptides. In a preferred embodiment, 
the antibodies are immunologically specific to the 
polypeptide of residues 1-76, 613-669 or 1144-1161 of SEQ 
ID NO: 2. 

The following description sets forth the 
30 general procedures involved in practicing the present 
invention. To the extent that specific materials are 
mentioned, it is merely for purposes of illustration and 
is not intended to limit the invention. Unless otherwise 
specified, general cloning procedures, such as those set 
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forth in Sambrook et al., Molecular Cloning , Cold Spring 
Harbor Laboratory (1989) (hereinafter "Sambrook et al.") 
or Ausubel et al. (eds) Current Protocols in Molecular 
Biology , John Wiley & Sons (1997) (hereinafter ^Ausubel 
5 et al ." ) are used. 



III. Preparation of P1PAC Nucleic Acid Molecules, 

encoded Polypeptides, Antibodies Specific for the 
Polypeptides and Transgenic Plants 



10 



1 . Nucleic Acid Molecules 

P1PAC nucleic acid molecules of the invention 
may be prepared by two general methods: (1) they may be 

15 synthesized from appropriate nucleotide triphosphates, or 
(2) they may be isolated from biological sources. Both 
methods utilize protocols well known in the art. 

The availability of nucleotide sequence 
information, such as the cDNA having SEQ ID NO:l, enables 

20 preparation of an isolated nucleic acid molecule of the 
invention by oligonucleotide synthesis. Synthetic 
oligonucleotides may be prepared by the phosphoramadite 
method employed in the Applied Biosystems 38A DNA 
Synthesizer or similar devices. The resultant construct 

25 may be purified according to methods known in the art, 
such as high performance liquid chromatography (HPLC) . 
Long, double-stranded polynucleotides, such as a DNA 
molecule of the present invention, must be synthesized in 
stages, due to the size limitations inherent in current 

30 oligonucleotide synthetic methods. Thus, for example, a 
long double-stranded molecule may be synthesized as 
several smaller segments of appropriate complementarity. 
Complementary segments thus produced may be annealed such 
that each segment possesses appropriate cohesive termini 

35 for attachment of an adjacent segment. Adjacent segments 
may be ligated by annealing cohesive termini in the 
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presence of DNA ligase to construct an entire long 
double-stranded molecule. A synthetic DNA molecule so 
constructed may then be cloned and amplified in an 
appropriate vector. 
5 P1PAC genes also may be isolated from 

appropriate biological sources using methods known in the 
art. In fact, the ATPAC clone was isolated from an 
Arabidopsis cDNA library using a partial clone obtained 
from Brassica napus. In alternative embodiments, genomic 

10 clones of plPAC may be isolated. 

In accordance with the present invention, 
nucleic acids having the appropriate level sequence 
homology with part or all the coding regions of SEQ ID 
NO:l or SEQ ID NO: 10 may be identified by using 

15 hybridization and washing conditions of appropriate 
stringency. For example, hybridizations may be 
performed, according to the method of Sambrook et al., 
using a hybridization solution comprising: 5X SSC, 
5X Denhardt's reagent, 1.0% SDS, 100 ug/ml denatured, 

20 fragmented salmon sperm DNA, 0.05% sodium pyrophosphate 

and up to 50% formamide. .Hybridization is carried out at 
37-42 °C for at least six hours. Following hybridization, 
filters are washed as follows: (1) 5 minutes at room 
temperature in 2X SSC and 1% SDS; (2) 15 minutes at room 

25 temperature in 2X SSC and 0.1% SDS; (3) 30 minutes-1 

hour at 37 °C in 2X SSC and 0.1% SDS; (4) 2 hours at 45- 
55°in 2X SSC and 0.1% SDS, changing the solution every 30 
minutes . 

One common formula for calculating the 
30 stringency conditions required to achieve hybridization 
between nucleic acid molecules of a specified sequence 
homology (Sambrook et al., 1989): 

T m = 81.5°C + 16.6Log [Na+] + 0.41(% G+C) - 0.63 (% formamide) - 600/#bp in duplex 
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As an illustration of the above formula, using [N+] ~ 
[0.368] and 50% formamide, with GC content of 42% and an 
average probe size of 200 bases, the T m is 57°C. The T m 
of a DNA duplex decreases by 1 - 1.5°C with every 1% 
5 decrease in homology. Thus, targets with greater than 
about 75% sequence identity would be observed using a 
hybridization temperature of 42 °C. 

The stringency of the hybridization and wash 
depend primarily on the salt concentration and 

10 temperature of the solutions. In general, to maximize 
the rate of annealing of the probe with its target, the 
hybridization is usually carried out at salt and 
temperature conditions that are 20-25°C below the 
calculated T m of the of the hybrid. Wash conditions 

15 should be as stringent as possible for the degree of 

identity of the probe for the target. In general, wash 
conditions are selected to be approximately 12-20°C below 
the T m of the hybrid. In regards to the nucleic acids of 
the current invention, a moderate stringency 

20 hybridization is defined as hybridization in 6X SSC, 5X 
Denhardt's solution, 0.5% SDS and 100 pg/ml denatured 
salmon sperm DNA at 42°C, and wash in 2X SSC and 0.5% SDS 
at 55°C for 15 minutes. A high stringency hybridization 
is defined as hybridization in 6X SSC, 5X Denhardt's 

25 solution, 0.5% SDS and 100 jig/ml denatured salmon sperm 
DNA at 42 °C, and wash in IX SSC and 0.5% SDS at 65°C for 
15 minutes. A very high stringency hybridization is 
defined as hybridization in 6X SSC, 5X Denhardt's 
solution, 0.5% SDS and 100 yig/ml denatured salmon sperm 

30 DNA at 42°C, and wash in 0.1X SSC and 0.5% SDS at 65°C 
for 15 minutes. 

Nucleic acids of the present invention may be 
maintained as DNA in any convenient cloning vector. In a 
preferred embodiment, clones are maintained in plasmid 
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cloning/expression vector, such as pGEM-T (Promega 
Biotech, Madison, WI) or pBluescript (Stratagene, La 
Jolla, CA) , either of which is propagated in a suitable 
E. coli host cell. 
5 PIP AC nucleic acid molecules of the invention 
include cDNA, genomic DNA, RNA, and fragments thereof 
which may be single- or double-stranded* Thus, this 
invention provides oligonucleotides (sense or antisense 
strands of DNA or RNA) having sequences capable of 

10 hybridizing with at least one sequence of a nucleic acid 
molecule of the present invention, such as selected 
segments of SEQ ID NO:l or SEQ ID NO: 10. Such 
oligonucleotides are useful as probes for detecting plPAC 
genes or mRNA in test samples, e.g. by PCR amplification, 

15 mapping of genes or for the positive or negative 

regulation of expression of plPAC genes at or before 
translation of the mRNA into proteins. 

The plPAC promoter is also expected to be 
useful in connection with the present invention, inasmuch 

20 as it is inducible in plants upon exposure to anion 

channel blockers. As mentioned above, seven-kilobase 
fragment of genomic DNA has been isolated, which contains 
part or all of the plPAC promoter from Arabidopsis 
thaliana. This promoter can be used in chimeric gene 

25 constructs to facilitate inducible expression of any 
coding sequence of interest, upon exposure to NPPB or 
similar-acting compounds. 

2 . Proteins 

30 Polypeptides encoded by plPAC nucleic acids of 

• the invention may be prepared in a variety of ways, 
according to known methods. If produced in situ the 
polypeptides may be purified from appropriate sources, 
e.g., plant roots or other plant parts. 
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Alternatively, the availability of nucleic acid 
molecules encoding the polypeptides enables production of 
the proteins using in vitro expression methods known in 
the art. For example, a cDNA or gene may be cloned into 
5 an appropriate in vitro transcription vector, such a 
P SP64 or pSP65 for in vitro transcription, followed by 
cell-free translation in a suitable cell-free translation . 
system, such as wheat germ or rabbit reticulocytes. In 
vitro transcription and translation systems are 
10 commercially available, e.g., from Promega Biotech, 
Madison, Wisconsin or BRL, Rockville, Maryland. 

According to a preferred embodiment, larger 
quantities of plPAC-encoded polypeptide may be produced 
by expression in a suitable procaryotic or eucaryotic 
15 system. For example, part or all of a DNA molecule, such 
as the cDNA having SEQ ID NO:l, may be inserted into a 
' plasmid vector adapted for expression in a bacterial cell 
(such as E. coli) or a yeast cell (such as Saccharomyces 
cerevisiae) , or into a baculovirus vector for expression 
20 in an insect cell. Such vectors comprise the regulatory 
elements necessary for expression of the DNA in the host 
cell, positioned in such a manner as to permit expression 
of the DNA in the host cell. Such regulatory elements 
required for expression include promoter sequences, 
25 transcription initiation sequences and, optionally, 
enhancer sequences. 

The plPAC polypeptide produced by gene 
expression in a recombinant procaryotic or eucyarotic 
system may be purified according to methods known in the 
30 art. In a preferred embodiment, a commercially available 
expression/secretion system can be used, whereby the 
recombinant protein is expressed and thereafter secreted 
from the host cell, to be easily purified from the 
surrounding medium. If expression/secretion vectors are 
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not used, an alternative approach involves purifying the 
recombinant protein by affinity separation, such as by 
immunological interaction with antibodies that bind 
specifically to the recombinant protein. Such methods 
5 are commonly used by skilled practitioners. 

The plPAOencoded polypeptides of the 
invention, prepared by the aforementioned methods, may be . 
analyzed according to standard procedures. 

10 C. Transgenic Plants 

Transgenic plants expressing the plPAC gene can 
be generated using standard plant transformation methods 
known to those skilled in the art. These include, but 
are not limited to, Agrobacterium vectors, PEG treatment 

15 of protoplasts, biolistic DNA delivery, UV laser 

microbeam, gemini virus vectors, calcium phosphate 
treatment of protoplasts, electroporation of isolated 
protoplasts, agitation of cell suspensions with 
microbeads coated with the transforming DNA, direct DNA 

20 uptake, liposome-mediated DNA uptake, and the like. Such 
methods have been published in the art. See, e.g., 
Methods for Plant Molecular Biology (Weissbach & 
Weissbach, eds., 1988); Methods in Plant Molecular 
Biology (Schuler & Zielinski, eds., 1989); Plant 

25 Molecular Biology Manual (Gelvin, Schilperoort, Verma, 
eds., 1993); and Methods in Plant Molecular Biology - A 
Laboratory Manual (Maliga, Klessig, Cashmore, Gruissem & 
Varner, eds., 1994) . 

The method of transformation depends upon the 

30 plant to be transformed. The biolistic DNA delivery 

method is useful for nuclear transformation. In another 
embodiment of the invention, Agrobacterium vectors are 
used to advantage for efficient transformation of plant 
nuclei . 
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In a preferred embodiment, the gene is 
introduced into plant nuclei in Agrobacterium binary 
vectors. Such vectors include, but are not limited to, 
BIN19 (Bevan, 1984, Nucleic Acid Res 12: 8711-8721) and 
5 derivatives thereof, the pBI vector series {Jefferson et 
al., 1987, PNAS 83:8447-51), and binary vectors pGA482 
and pGA492 (An, 1986) and others (for review, see An, 
1995, Methods Mol Biol 44:47-58) . In preferred 
embodiments, the pPZP211 vector (Hajdukiewicz et al,, 

10 1994, PMB 25:989-994) or PCGN7366 (Calgene, CA) are used, 
DNA constructs for transforming a selected plant comprise 
a coding sequence of interest operably linked to 
appropriate 5* (e.g., promoters and translational 
regulatory sequences) and 3 T regulatory sequences (e.g., 

15 terminators) . 

Using an Agrobacterium binary vector system for 
transformation, the plPAC coding region, under control of 
a constitutive or inducible promoter as described above, 
is linked to a nuclear drug resistance marker, such as 

20 kanamycin resistance. Agrobacterium-mediated 

transformation of plant nuclei is accomplished according 
to the following procedure: 

(1) the gene is inserted into the selected 
Agrobacterium binary vector; 

25 (2) transformation is accomplished by co- 

cultivation of plant tissue (e.g., leaf discs) with a 
suspension of recombinant Agrobacterium, followed by 
incubation (e.g., two days) on growth medium in the 
absence of the drug used as the selective medium (see, 

30 e.g., Horsch et al . 1985, Cold Spring Harb Symp Quant 
Biol. 50:433-7) ; 

(3) plant tissue is then transferred onto the 
selective medium to identify transformed tissue; and 

(4) identified transf ormants are regenerated 
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to intact plants. 

It should be recognized that the amount of 
expression, as well as the tissue specificity of 
expression of the plPAC gene in transformed plants can 
5 vary depending on the position of their insertion into 

the nuclear genome. Such position effects are well known 
in the art. For this reason, several nuclear 
transformants should be regenerated and tested for 
expression of the transgene. 

10 

IV. Uses of PlPAC Nucleic Acids, Encoded Proteins 
and Antibodies 



1 • PlPAC Nucleic Acids 

15 PlPAC nucleic acids may be used for a variety 

of purposes in accordance with the present invention. 
The DNA, RNA, or fragments thereof may be used as probes 
to detect the presence of and/or expression of plPAC 
genes. Methods in which plPAC nucleic acids may be 

20 utilized as probes for such assays include, but are not 
limited to: (1) in situ hybridization; (2) Southern 
hybridization (3) northern hybridization; and (4) 
assorted amplification reactions such as polymerase chain 
reactions (PCR) . 

25 The plPAC nucleic acids of the invention may 

also be utilized as probes to identify related genes from 
other plant species. As is well known in the art and 
described above, hybridization stringencies may be 
adjusted to allow hybridization of nucleic acid probes 

30 with complementary sequences of varying degrees of 
homology. Thus, plPAC nucleic acids may be used to 
advantage to identify and characterize other genes of 
varying degrees of relation to the exemplary ATPAC, 
thereby enabling further characterization of this family 
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of genes in plants* Additionally,, they may be used to 
identify genes encoding proteins that interact with the 
P-glycoprotein encoded by plPAC (e.g., by the 
"interaction trap" technique) . 

5 

2 . PlPAC Proteins and Antibodies 

Purified plPAC-encoded P-glycoproteins, or 
fragments thereof, may be used to produce polyclonal or 
monoclonal antibodies which also may serve as sensitive 

10 detection reagents for the presence and accumulation of 
plant P-glycoproteins in cultured plant cells or tissues 
and in intact plants. Recombinant techniques enable 
expression of fusion proteins containing part or all of 
the pJPAC-encoded protein. The full length protein or 

15 fragments of the protein may be used to advantage to 

generate an array of monoclonal or polyclonal antibodies 
specific for various epitopes of the protein, thereby 
providing even greater sensitivity for detection of the 
protein in cells or tissue. 

20 Polyclonal or monoclonal antibodies 

immunologically specific for pIPAOencoded proteins may 
be used in a variety of assays designed to detect and 
quantitate the protein. Such assays include, but are not 
limited to: (1) flow cytometric analysis/ (2) 

25 immunochemical localization in cultured cells or tissues; 
and (3) immunoblot analysis (e.g., dot blot, Western 
blot) of extracts from various cells and tissues. 

Polyclonal or monoclonal antibodies that 
immunospecif ically interact with one or more of the 

30 polypeptides encoded by plPAC can be utilized for 

identifying and purifying such proteins. For example, 
antibodies may be utilized for affinity separation of 
proteins with which they immunospecif ically interact. 
Antibodies may also be used to immunoprecipitate proteins 
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from a sample containing a mixture of proteins and other 
biological molecules. 

3. plPAC Transgenic Plants 

5 Transgenic plants that over- or under- express 

plPAC can be used in a varied of agronomic and research 
applications. From the foregoing discussion, it can be 
seen that plPAC and its homologs, and transgenic plants 
containing them will be useful for improving stress 

10 resistance or tolerance in plants. This provides an 

avenue for developing marginal or toxic soil environments 
for crop production. Both over- and under-expressing 
plPAC transgenic plants have great utility in the 
research of herbicides and other xenobiotic compounds. 

15 As discussed above and in greater detail in 

Example 1, the similarity between plant and mammalian mdr 
genes indicates that their functional aspects will also 
be conserved. Thus, plPAC is expected to play an 
important, role in the exclusion of toxic metabolic or 

20 xenobiotic compounds from cells. The fact that plPAC 
also is inducible and appears to be preferentially 
expressed in roots, where contact with such compounds 
often occurs, makes plPAC particularly desirable for 
genetic engineering of plants to increase their tolerance 

25 to such compounds. Accordingly, plants engineered to 

overexpress the plPAC gene should be resistant to a wide 
range of chemicals, both intentionally applied as 
herbicides or unintentionally as wastes. Examples of the 
kinds of xenobiotics that should be detoxified by the 

30 plPAC of the invention include, but are not limited to, 
hydrophobic (i.e., lipophilic) herbicides and other 
compounds, such as 3 (3, 4-dichlorophenyl) -1, 1, dimethyl 
urea (also known as DCMU or Diuron, available from Sigma 
Chemical Co., St. Louis, MO) or other hydrophobic 
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compounds that disrupt photosynthetic electron transport, 
as well as Metachlor (Ciba Geigy, Basel Switzerland) , 
Taurocholate (Sigma Chemical Co.), Primisulfuron (Ciba 
Geigy) , and IRL-1803. 

As illustrated in Example .2, plant cells that 
over-express a plPAC gene have surprisingly higher growth 
rate with or without the xenobiotic compound Rhodamine 
6G. It is contemplated that plPAC overexpression may be 
a generally useful way to increase plant and plant cell 
culture growth, even without the presence of xenobiotic 
compounds . 

The following specific examples are provided to 
illustrate embodiments of the invention. They are not 
intended to limit the scope of the invention in any way. 



20 



EXAMPLE 1 
Cloning and Analysis of a 
PlPAC From Arabidonsi s thaliana 



The plPAC of the present invention was 
identified by its up-regulation in response to a chloride 

25 ion channel blocker. Brassica napus plants were grown 

either in the presence or absence of 20 uM 5-nitro-2- (3- 
phenylpropyl amino) benzoic acid (NPPB) . After five days, 
the roots of the seedlings were harvested and total RNA 
was extracted separately from the treated and untreated 

30 plants. From the total RNA preparations, poly (A) + RNA 
was isolated and used as the starting material to create 
a cDNA subtraction library, using the CLONTECH PCR- 
SELECT™ cDNA Subtraction Kit and accompanying 
instructions (CLONTECH Laboratories, Inc., Palo Alto, 

35 CA) . 
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Using the subtractive hybridization kit, a gene 
fragment was identified that was up-regulated in response 
to treatment of the plants with NPPB. This fragment (0.5 
kb) was used to screen a cDNA library of Arabidopsis 
5 thaliana, from which a full-length cDNA clone was 

isolated. The nucleotide sequence of this cDNA clone, 
referred to as ATPAC {Arabidopsis thaliana putative anion • 
channel) is set forth below as SEQ ID NO:l. 

The 3.7 6 kb cDNA clone encodes a polypeptide 

10 1,254 amino acids in length. The deduced amino acid 

sequence encoded by SEQ ID N0:1 is shown in Figure 1 as 
*atpac" (SEQ ID NO:2), in a lineup with the following 
sequences: (1) hmdrl (SEQ ID NO:3); (2) mmdrl (SEQ ID 
NO:4); (3) hmdr3 (SEQ ID NO:5); (4) mmdr2 (SEQ ID NO:6); 

15 (5) atpgpl (SEQ ID NO:7); and (6) atpgp2 (SEQ ID NO: 8). 
A consensus sequence (SEQ ID NO: 9) is also shown. 

A search of various sequence databases 
indicates that ATPAC is a new and distinct member of the 
mdr family of ABC transporters. In none of the 

20 databases, including the EST collection, does an exact 
match exist. The ABC transporter family is very large, 
consisting of at least two sub-groups, mrp and homologs 
and mdr and homologs. The only examples of plant mdr- 
like genes are atpgpl and atpgp2 from A, thaliana and two 

25 homologs from potato and barley, respectively. Though 
. the atpgpl and atpgp2 genes are similar to ATPAC, they 
are only 51 and 50% identical, respectively, indicating 
that ATPAC is a distinct gene by comparison. Sequence 
homology with the potato and barley mdr-like genes is 

30 even more divergent. Another difference between the 
agpgpl gene and the ATPAC gene is their respective 
preferential expression in inflorescens and. roots, 
respectively. 
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EXAMPLE 2 

Effect of ATPAC Expression in Bacterial Cells 
5 on Their Ability to Detoxify Rhodamine 6G 

The compound Rhodamine 6G is a well known 
substrate of mammalian p-glycoproteins (Kolaczkowski et 
al., J. Biol. Chem. 271: 31543-31548, 1996). The ability 

10 of a cell to detoxify the compound is indicative of 

activity of p-glycoproteins. A bacterial cell line was 
transformed with an expression vector comprising ATPAC. 
The growth rate of transformed and non-transformed cells 
was then measured, in the presence or absence of 

15 Rhodamine 6G. Results are shown in Figure 2. As can be 
seen, ATP^C-expressing cells grown in the absence of the 
drug had the best growth rate. Moreover, even in the 
presence of the drug, the cells grew more quickly than 
non-transformed cells in the presence or absence of 

20 Rhodamine 6G. These results demonstrate that ATPAC 
encodes a functional and robust p-glycoprotein. 

Example 3 
Transgenic Plants the Overexpress 
25 and Underexpress ATPAC 

Transformation construct. The Agrobacterium 
binary vector pPZP211 (Hajdukiewicz et al., 1994 Plant 
Mol. Biol. 25:989-994) was digested with EcoRI and Smal, 

30 and self-ligated. This molecule was named pPZP211'. The 
Agrobacterium binary vector pCGN7366 (Calgene, CA) was 
digested with Xhol and cloned in Sail-digested pPZP211 f . 
We named this binary vector pPZP-PCGN. The 3.8 kb full- 
length ATPAC cDNA was cloned into the pGH19 vector. 

35 After digestion with Smal (in the multiple cloning site 
upstream) and EcoRI, a 3.1 kb cDNA fragment was cut out. 
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This Smal-EcoRI 3.1 kb fragment was cloned into the 
Smal/EcoRI site of pPZP-pCGN. The rest of ATPAC gene was 
amplified using polymerase chain reaction to have 
translationally fused HA-tag at its 3 1 -terminal . After 
5 ligating EcoRl linkers to the ends of the resulting PCR 
product, the 0,7 kb fragment was cloned into the EcoRI 
site of the Smal-EcoRI 3.1 kb ATPAC fragment in pPZP- 
pCGN. The final construct was named pATPAC-OE. 

Plant transformation. pATPAC-OE was introduced 
10 into Agrobacterium tumefaciens strain by a direct 
transformation method. Agrobacteri am-mediated 
transformation was performed using vacuum infiltration 
(Bechtold et al., 1993,. CR Acad. Sci . [Ill] 316: 
1194-1199.) 

15 Tl plants which survived on 

kanamycin-containing plates were selected, transplanted 
into soil and grown to set T2 seed. T3 seeds were 
collected from kanamycin-resistant T2 plants. T3 plants 
which showed 100% kanamycin-resistance were selected and 

20 were considered homozygous for the transgene. 

Antisense Plants. The full length cDNA in 
pBluescript SK(-) vector (Stratagene, CA) is digested 
with EcoRI (there is a cleavage site in the upstream 

25 polylinker) and Sspl. The resulting 1.3 Kb fragment 

representing a 5* portion of the AtPAC cDNA was cloned 
into the aforementioned pPZP-PCGN, which had been 
digested with EcoRI/ Smal, ensuring that this fragment of 
the cDNA was inserted in the antisense orientation. This 

30 construct was named pATPAC-AE . pATPAOAE was introduced 
into Arabidopsis plants by Agrobacterium transformation, 
as described above. 
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Knock-out Plants. The method of Krysan et al 
(1996, PNAS 93:8145, incorporated by reference herein) 
was followed using the following primers: 
Gene-specific primers: 

AtpacF; CACTGCTCAATGATCTCGTTTTCTCACTA (SEQ ID NO: 11) 
AtpacR: C T T GAAT CACACCAATGCAATCAACACCT C (SEQ ID NO: 12) 
Primers for T-DNA left boarder: 

JL202: CATTTTATAATAACGCTGCGGACATCTAC (SEQ ID NO: 13) 
JL270: TTTCTCCATATTGACCATCATACTCATTG (SEQ ID NO: 14) 

While certain of the preferred embodiments of 
the present invention have been described and 
specifically exemplified above, it is not intended that 
the invention be limited to such embodiments. Various 
modifications may be made thereto without departing from 
the scope and spirit of the present invention, as set 
forth in the following claims. 
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What is claimed: 

1. A nucleic acid isolated from a plant, which 
encodes a p-glycoprotein that is inducible by exposure of 
5 the plant to NPPB. 



2. The isolated nucleic acid of claim 1, which . 
is preferentially expressed in plant roots upon exposure 
of the plant to NPPB. 

10 

3. The isolated nucleic acid of claim 1, 
wherein the plant is selected from the group consisting 
of Brassica napus and Arabidopsis thaliana and is 3850- 
4150 nucleotides long. 

15 

4. The isolated nucleic acid of claim 1, which 
has the restriction sites shown in Figure 4 for at least 
three enzymes. 

20 5. The isolated nucleic acid of claim 4, which 

encodes a polypeptide having SEQ ID NO: 2. 

6. The isolated nucleic acid of claim 5, which 
is a cDNA comprising a coding region selected from the 

25 group consisting of SEQ ID NO:l and SEQ ID NO: 10. 

7. An isolated protein, which is a product of 
expression of part or all of the isolated nucleic acid 
molecule of claim 1. 

30 

8. Antibodies immunologically specific for the 
protein of claim 7. 



9. A expression cassette, which comprises a 
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plPAC gene coding sequence operably linked to a promoter. 



10. The expression cassette of claim 9, which 
comprises a plPAC gene from Arabidopsis thaliana. 

5 

11. The expression cassette of claim 10, in 
which the promoter is the cauliflower mosaic virus 35S 
promoter. 

10 12. The expression cassette of claim 10, in 

which the plPAC gene is part or all of SEQ ID N0:1 or SEQ 
ID NO: 10. 

13. A vector comprising the expression 
15 cassette of claim 9. 

14. The vector of claim 13, which is comprised 
of an Agrobacterium binary vector selected from the group 
consisting of pPZP211 and pCGN7366. 

20 

15. A method for producing a plant with 
enhanced resistance to xenobiotic compounds by 
transforming in vitro the plant with the expression 
cassette of claim 9. 

25 

16. The method of claim 15, wherein the 
transformation step further uses the vector of claim 13. 

17. A transgenic plant produced by the method 
30 of claim 15. 

18. A reproductive unit form the transgenic 
plant of claim 17. 



WO 00/18886 PCT/US99/22363 

- 35 - 

19. A cell from the transgenic plant of claim 

17. 

20. A recombinant DNA molecule comprising the 
nucleic acid molecule of claim 1, operably linked to a 
vector for transforming cells. 

21. A cell transformed with the recombinant 
DNA molecule of claim 20. 

22. The cell of claim 21, selected from the 
group consisting of bacterial cells, yeast cells and 
plant cells. 

23. A transgenic plant regenerated from the 
transformed cell of claim 22. 

24. An isolated nucleic acid molecule of at 
least 20 nucleotides in length having a sequence selected 
from the group consisting of: 

a) SEQ ID N0:1 and SEQ ID NO: 10; 

b) a nucleic acid sequence that is at least 
about 60% homologous to the coding regions of SEQ ID NO:l 
or SEQ ID NO: 10; 

c) a sequence hybridizing with SEQ ID NO:l or 
SEQ ID NO: 10 at moderate stringency; 

d) a sequence encoding part or all of a 
polypeptide having SEQ ID NO: 2; 

e) a sequence encoding an amino acid sequence 
that is at least about 70% identical to SEQ ID NO:2; 

f) a sequence encoding an amino acid sequence 
that is at least about 80% similar to SEQ ID NO: 2; 

g) a sequence encoding an amino acid sequence 
that is at least about 40% similar to residues 1-76, 613- 



5 
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669 or 1144-1161 of SEQ ID NO:2; and 

h) a sequence hybridizing at moderate 
stringency to a sequence encoding residues 1-76, 613-669 
or 1144-1161 of SEQ ID NO:2. 

25. A polypeptide produced by expression of the 
nucleic acid sequence of claim 24. 

26. Antibodies immunologically specific for the 
10 polypeptide of claim 24. 

27. An oligonucleotide between about 10 and 
about 100 nucleotides in length, which specifically 
hybridizes at moderate stringency with a portion of the 

15 nucleic acid molecule of claim 24. 

28. A recombinant DNA molecule comprising the 
nucleic acid molecule of claim 24, operably linked to a 
vector for transforming cells. 

20 

29. A cell transformed with the recombinant DNA 
molecule of claim 28. 

30. The cell of claim 29, selected from the 

25 group consisting of bacterial cells, yeast cells and plant 
cells . 

31. A transgenic plant regenerated from the 
cell of claim 30. 



30 



32. An isolated plant p-glycoprotein, which is 
inducible upon exposure of the plant to NPPB. 



33. The p-glycoprotein of claim 32, which 
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confers upon a cell in which it is found resistance to 
Rhodamine 6G. 

34. The p-glycoprotein of claim 33, which is 

5 preferentially produced in roots upon the exposure to the 
NPPB. 

35. The p-glycoprotein of claim 34, from a 
plant selected from the group consisting of Brassica napus 

10 and Arabidopsis thaliana. 

36. The p-glycoprotein of claim 35, having an 
amino acid sequence that selected from the group 
consisting of: 

15 a) an amino acid sequence that is at least 80% 

similar to SEQ ID NO: 2; 

b) an amino acid sequence that is at least 70% 
identical to SEQ ID NO: 2; 

c) an amino acid sequence that is at least 40% 
20 similar to residues 1-76, 613-669 or 1144-1161 of SEQ ID 

NO: 2; and 

d) an amino acid sequence encoded by a nucleic 
acid sequence hybridizing at moderate stringency to a 
amino acid sequence encoding residues 1-76, 613-669 or 

25 1144-1161 of SEQ ID NO:2. 

37. Antibodies immunologically specific for the 
p-glycoprotein of claim 32. 

30 38 The antibodies of claim 35, that are 

immunologically specific to residues 1-76, 613-669 or 
1144-1161 of SEQ ID NO:2. 

39. A plant p-glycoprotein gene promoter which 
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40. The plant p-glycoprotein gene promoter of 
claim 39, that is part or all of residues 1-3429 of SEQ ID 
5 NO: 10. 



41. A plant with reduced levels of plPAC 

. protein. 

10 42. The plant of claim 41, wherein the native 

plPAC gene is mutated. 

43. The plant of claim 42, wherein the plPAC 
gene is mutated due to the insertion of a T-DNA. 



15 



44. A method for making the plant of claim 42, 
wherein a population of mutated plants are screened using 
at least one of SEQ ID NOS: 11-14 as PCR primers. 



20 45. The method of claim 44, wherein the 

population of plants is mutated by T-DNA insertion. 
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1. 



SEQUENCE LISTING 

<110> Wisconsin Alumni Research Foundation 
Spalding, Edgar P. 
Noh, Bosl 

<120> Xenobiotic Detoxification Gene from 
Plants* 

<130> WARF S212 

<150> 60/101,814 
<151> 1998-09-25 

<160> 14 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 4051 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> mis cofeature 

<222> (94) ... (0) 

<223> Translation start codon 

<221> misc_feature 
<222> (3932) ... (0) 
<223> Stop codon 

<400> 1 

cttgaacttc acaaaacaat tgtcagattt tcaagaaaaa ctttataaaa caaaaaacat 60 

ttcattcttt ctctctctct ctctcactgc tcaatgatct cgttttctca ctaaaccaac 120 

tcgtttcttc ttactttctt taactcggat ctacaaaaaa ccatgtcgga aactaacaca 180 

accgatgcca agactgttcc agcagaagca gagaagaaga aagaacagag tttaccattc 240 

tttaaactct tttctttcgc tgataaattt gattatctct taatgttcgt tggttctctt 300 

ggtgccattg ttcatggctc ttccatgcct gtcttctttt tactctttgg tcaaatggtt 360 

aatggatttg gtaaaaacca aatggattta catcaaatgg ttcatgaagt ctctagatat 420 

tctctatatt tcgtctactt gggtttggtc gtttgcttct cttcttacgc agagatagca 4 80 

tgttggatgt attctggaga aagacaagta gcagcattaa ggaagaaata tcttgaagca 540 

gtattaaaac aagacgttgg gttctttgat actgatgcta gaactggtga cattgtcttt 600 

agtgtttcta ctgatactct tcttgttcaa gatgccatta gtgaaaaggt tggaaacttt 660 

atacattacc tctcaacatt tttggcggga ttagtagttg gatttgtatc agcatggaaa 720 

ttagctttgt taagtgttgc tgtgattccc ggaatcgctt tcgccggagg tttatacgct 780 

tatacactca ccggaattac ttcaaagagc cgtgaatctt atgctaacgc cggtgttatc 840 

gccgagcagg caattgctca agttcgaact gtttattctt atgttggaga gagtaaggca 900 

cttaatgcgt attcggatgc gattcagtat acgcttaagc tcggttataa agcggggatg 960 

gctaaagggt tgggtttagg atgtacttat ggaatagctt gtatgtcatg ggctttggtg 1020 

ttttggtatg ctggagtttt tattcggaat ggacaaaccg atggaggaaa ggcgtttact 1080 

gctatattct ctgctattgt tggtggaatg agtttggggc aatctttctc gaatcttggg 1140 

gcgtttagta aaggtaaagc ggctggttat aagttgatgg agataattaa ccagagaccg 1200 

acgataattc aagacccgtt ggatggaaaa tgtttggatc aagttcatgg gaacattgag 1260 

tttaaagatg tgacttttag ctatccttca cggcctgatg ttatgatctt caggaacttt 1320 

aatattttct tcccttctgg gaaaactgtg gcggttgttg gtgggagtgg ctctggaaag 1380 

agtactgttg tttccctcat tgagagattc tatgatccaa acagcgggca aattctgttg 1440 

gatggtgttg agataaagac gcttcagttg aagtttttgc gtgaacaaat cgggcttgtg 1500 

aatcaagaac ctgcgctctt tgccactact atactagaga acatactcta tggaaagcct 1560 

gatgcaacaa tggttgaagt tgaagctgct gcttccgctg cgaatgcgca tagtttcatt 1620 

acattacttc ctaaaggcta cgacacacag gttggagaac gtggtgttca actctcaggt 1680 
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ggacagaagc agagaattgc aattgctagg gcgatgttga aagacccaaa gattctgtta 1740 

ctagatgaag ctacaagcgc tcttgatgct agctctgaga gcattgttca ggaagcttta 1800 

gacagagtca tggtggggag gaccactgtt gttgttgctc atcgtctctg caccatcaga 1860 

aatgttgatt ccattgccgt gatacagcaa ggccaagttg ttgaaaccgg aacacatgaa 1920 

gaactcattg ccaaatccgg tgcttacgca tccctcatca ggtttcagga aatggttggt 198 0 

actcgagatt tctcaaaccc gtcaactcgt cgcactcgtt caacccgttt gagccattca 204 0 

ctgtcaacga aatcactcag tttaagatca ggaagtttga ggaatctgag ctattcttac 2100 

agcactggag ctgatggtcg gatagagatg atttcaaatg cagagactga ccgaaagact 2160 

cgtgcccctg aaaattactt ctacaggctt ctcaagctta attcaccgga atggccttac 2220 

tcaatcatgg gagcagtagg ctcaattctt tctggtttca ttggtcctac atttgctatt 2280 

gtgatgagca acatgatcga agtcttctac tacacagact atgattcaat ggaaaggaaa 2340 

acaaaagagt atgtcttcat ctacattggt gctggtctct atgctgtggg tgcttatttg 2400 - 

atccaacatt acttctttag catcatggga gaaaacctca caacaagagt aagaagaatg 2460 

atgctctcag ctatcttgag aaacgaagtt ggttggttcg atgaggatga acacaactca 2520 

agcctgatcg ctgcacgttt agctactgat gcagcagatg ttaaatccgc tatagccgag 2580 

agaatctcag taattctaca aaacatgact tcacttctca catccttcat agtcgccttc 2640 

atagtagaat ggagagtctc acttctcatc ttaggcacat tcccacttct agtcctcgct 2700 

aactttgctc agcaactatc tctgaagggt tttgctggag acacagctaa ggctcatgca 2760 

aagacttcaa tgattgctgg tgaaggagtc agtaacatta gaaccgtagc agctttcaat 2820 

gcacagagca agattctctc tttgttctgt catgagcttc gtgtacctca gaaaagaagc . 2880 

ttaagcttat accgaagtca aacctcgggt ttcctatttg gcctctcgca gcttgctctc 2940 

tatggttctg aggctttaat tctctggtat ggtgcccacc ttgtgagtaa aggcgtgtca 3000 

accttttcca aagtgatcaa agtgtttgtg gttttggtca ttactgcaaa ctctgttgct 3060 

gaaactgtca gtcttgctcc tgaaattatt cggggaggtg aagctgttgg ttcggttttc 3120 

tcggtcttgg acaggcagac caggattgac ccggatgatg ctgatgctga tcccgtggag 3180 

acgatccgtg gagacattga gtttaggcat gttgatttcg cttacccttc aagacccgac 3240 

gtcatggttt tcagggactt taacctcaga attcgagctg gacatagcca agctcttgtg 3300 

ggcgcgagtg ggtcagggaa gagttctgta attgcgatga tcgagcggtt ttacgacctt 3360 

cttgctggaa aagtcatgat tgatggcaaa gacatccgcc ggctaaacct gaaatctcta 3420 

aggctcaaaa tcggtcttgt tcaacaagaa ccagctcttt tcgcagcaac gatcttcgac 34 80 

aacatcgcct atggtaaaga tggtgcaact gaatccgagg taattgatgc agctcgagcc 3540 

gcaaatgctc acggtttcat cagtggttta cctgaaggtt acaaaactcc agtaggcgaa 3600 

agaggagtgc agttatcagg tggacagaaa cagaggatcg cgatagcaag agctgtgctc 3 660 

aagaacccta cagtgttgct tctagacgaa gcaactagcg cactagatgc agaatcagaa 3720 

tgcgtgctgc aagaggcgtt agagaggctc atgagaggtc ggaccaccgt ggtagttgct 3780 

caccgcttgt ccaccataag aggtgttgat tgcattggtg tgattcaaga cgggcggatt 3840 

gtggagcaag gcagccattc agagctcgtt agccgaccag agggagctta ttcaaggctg 3900 

ttacagcttc aaacacatag gatttgaagc ttgatcatgg attaaaaaca aaaaatcggt 3960 

ttgtgtaatt ttttttatat taaaacttta atttggaaga tttctatgga ctataacgat 4 020 

aatatgaata ggtgtagata atgaagcttt t 4051 

<210> 2 
<211> 1254 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 2 

Met Ser Glu Thr Asn Thr Thr Asp Ala Lys Thr Val Pro Ala Glu Ala 

15 10 15 

Glu Lys Lys Lys Glu Gin Ser Leu Pro Phe Phe Lys Leu Phe Ser Phe 

20 25 30 

Ala Asp Lys Phe Asp Tyr Leu Leu Met Phe Val Gly Ser Leu Gly Ala 

35 * 4 0 45 

He Val His Gly Ser Ser Met Pro Val Phe Phe Leu Leu Phe Gly Gin 

50 55 60 

Met Val Asn Gly Phe Gly Lys Asn Gin Met Asp Leu His Gin Met Val 
65 " 70 75 80 

His Glu Val Ser Arg Tyr Ser Leu Tyr Phe Val Tyr Leu Gly Leu Val 

85 ^ 90 95 

Val Cys Phe Ser Ser Tyr Ala Glu He Ala Cys Trp Met Tyr Ser Gly 

100 105 110 

Glu Arg Gin Val Ala Ala Leu Arg Lys Lys Tyr Leu Glu Ala Val Leu 
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115 










120 


Lvs 


Gin 


Asp 


Val 


Gly 


Phe 


Phe Asp 


130 










135 




Val 


Phe 


Ser 


Val 


Ser 


Thr Asp Thr 


145 










150 






Glu 


Lys 


Val 


Gly Asn 


Phe 


He 


His 








165 








Leu 


Val 


Val 


Gly Phe 


Val 


Ser 


Ala 








180 










Ala 


Val 


He 


Pro 


Gly 


He 


Ala 


Phe 






195 










200 


Leu 


Thr 


Glv 


He 


Thr 


Ser 


Lys 


Ser 




210 










215 




Val 


lie 


Ala 


Glu 


Gin 


Ala 


He 


Ala 












230 






Val 


Glv 


Glu 


Ser 


Lys 


Ala 


Leu 


Asn 








245 








Thr 


Leu 


Lys 


Leu Gly 


Tyr 


Lys 


Ala 








260 










Gly 


Cys 


Thr 


Tyr Gly 


He 


Ala 


Cys 
















280 


Tyr 


Ala 


Gly 


Val 


Phe 


He Arg Asn 


290 










295 




Phe 


Thr 


ZXl ^ 
r\J- d 


He 


Phe 


Ser 


Ala 


He 


305 










310 






Ser 


Phe 


Ser 


Asn 


Leu 


Gly Ala 


Phe 










325 








Lys 


Leu 


Met 


Glu 


He 


He 


Asn 


Gin 






340 










Leu Asp 


Gly 


Lys 


Cys 


Leu Asp 


Gin 






355 










360 


Asp 


Val 


Thr 


Phe 


Ser 


Tyr 


Pro 


Ser 


370 










375 




Asn 


Phe 


Asn 


He 


Phe 


Phe 


Pro 


Ser 


385 










390 






Gly 


Ser 


Gly 


Ser Gly 


Lys 


Ser 


Thr 










405 








Tyr Asp 


Pro 


Asn 


Ser 


Gly Gin lie 








420 










Thr 


Leu 


Gin 


Leu 


Lys 


Phe 


Leu Arg 






435 










440 


Glu 


Pro 


Ala 


Leu 


Phe 


Ala 


Thr 


Thr 




450 










455 




Lys 


Pro 


Asd 


Ala 


Thr 


Met 


Val 


Glu 


465 










470 






Asn 


Ala 


His 


Ser 


Phe 


He 


Thr 


Leu 










485 








Val 


Gly 


Glu 


Arg Gly 


Val 


Gin 


Leu 








500 










Ala 


He 


Ala 


Arg Ala 


Met 


Leu 


Lys 






515 










520 


Glu 


Ala 


Thr 


Ser 


Ala 


Leu 


Asp 


Ala 




530 










535 




Ala 


Leu 


Asp 


Arg 


Val 


Met 


Val 


Gly 


545 










550 






Arg 


Leu 


Cys 


Thr 


He 


Arg 


Asn 


Val 










565 








Gly 


Gin 


Val 


Val 


Glu 


Thr Gly Thr 






580 










Gly Ala 


Tyr 


Ala 


Ser 


Leu 


He Arg 






595 










600 


Asp 


Phe 


Ser 


Asn 


Pro 


Ser Thr Arg 



3 



125 



Thr 


Asp 


Ala 


Arg 


Thr 


Gly 


Asp 


lie 








140 










Leu 


Leu 


Val 


Gin 


Asp 


Ala 


lie 


Ser 






155 










160 


Tyr 


Leu 


Ser 


Thr 


Phe 


Leu 


Ala 


Gly 


170 










175 




Trp 


Lys 


Leu 


Ala 


Leu 


Leu 


Ser 


Val 


185 








190 






Ala 


Gly 


Gly 


Leu 


Tyr 


Ala 


Tyr 


Thr 










205 








Arcr 


Glu 


Ser 


Tyr 


Ala 


Asn 


Ala 


Gly 








220 










Gin 


Val 


Arg 


Thr 


Val 


Tyr 


Ser 


Tyr 






235 










240 


Ala 


Tvr 


Ser 


Asp 


Ala 


He 


Gin 


Tyr 




250 










255 




Glv 


Met 


Ala 


Lys 


Gly 


Leu 


Gly 


Leu 


265 










270 






Met 


Ser 


TrD 


Ala 


Leu 


Val 


Phe 


Trp 

It 










285 








Glv 


Gin 


Thr 


Asp 


Glv 


Gly 


Lvs 


Ala 








300 










Val 


Glv 


Glv 


Met 


Ser 


Leu 


Gly 


Gin 






315 










320 


Ser 


Lvs 


Glv 


Lvs 


Ala 


Ala 


Gly 


Tyr 




330 










335 




Arg 


Pro 


Thr 


He 


He 


Gin 


Asp 


Pro 


345 










350 






Val 


His 


Glv 


Asn 


He 


Glu 


Phe 


Lys 










365 








Arg 


Pro 


Asp 


Val 


Met 


He 


Phe 


Arg 








380 










Glv 


Lvs 


Thr 


Val 


Ala 


Val 


Val 


Gly 






395 










400 


Val 


Val 


Ser 


Leu 


He 


Glu 


Arg 


Phe 




410 










415 




Leu 


Leu 


Asp 


Gly 


Val 


Glu 


lie 


Lys 


425 










430 






Glu 


Gin 


He 


Gly 


Leu 


Val 


Asn 


Gin 








445 








He 


Leu 


Glu 


Asn 


He 


Leu 


Tyr 


Gly 








460 










Val 


Glu 


Ala 


Ala 


Ala 


Ser 


Ala 


Ala 






475 










480 


Leu 


Pro 


Lys 


Gly 


Tyr 


Asp 


Thr 


Gin 




490 










495 




Ser 


Gly 


Gly 


Gin 


Lys 


Gin 


Arg 


lie 


505 










510 






Asp 


Pro 


Lvs 


He 


Leu 


Leu 


Leu 


Asp 








525 








Ser 


Ser 


Glu 


Ser 


He 


Val 


Gin 


Glu 








540 










Arg 


Thr 


Thr 


Val 


Val 


Val 


Ala 


His 




555 










560 


Asp 


Ser 


He 


Ala 


Val 


He 


Gin 


Gin 


t n r> 
«J / u 










575 




His 


Glu 


Glu 


Leu 


He 


Ala 


Lys 


Ser 


585 










590 






Phe 


Gin 


Glu 


Met 


Val 


Gly 


Thr 


Arg 










605 








Arg 


Thr 


Arg 


Ser 


Thr 


Arg 


Leu 


Ser 
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610 








615 










620 








His 


Ser 


Leu 


Ser Thr Lys 


Ser 


Leu 


Ser 


Leu 


Arg 


Ser 


Gly 


Ser Leu 


Arg 


625 








630 










635 






He Glu 


640 


Asn 


Leu 


Ser 


Tyr 


Ser Tyr 


Ser 


Thr 


Gly 


Ala 


Asp 


Gly 


Arg 


Met 








645 








650 








655 




lie 


Ser 


Asn 


Ala 


Glu Thr 


Asp 


Arg 


Lys 


Thr 


Arg 


Ala 


Pro 


Glu Asn 


Tyr 








660 








665 










670 




Phe Tyr 


Ara 


Leu 


Leu Lys 


Leu 


Asn 


Ser 


Pro 


Glu 


Trp 


Pro 


Tyr Ser 


lie 






675 








680 










685 






Met 


Gly 


Ala 


Val 


Gly Ser 


lie 


Leu 


Ser 


Gly 


Phe 


lie 


Gly 


Pro Thr 


Phe 




690 








695 










700 








Ala 


lie 


Val 


Met 


Ser Asn 


Met 


He 


Glu 


Val 


Phe 


Tyr 


Tyr 


Thr Asp 


Tyr 


705 








710 










715 








720 


Asp 


Ser 


Met 


Glu Arg Lys 


Thr 


Lys 


Glu 


Tyr 


Val 


Phe 


He 


Tyr He 


Gly 








725 








730 








735 




Ala 


Gly 


Leu 


Tyr Ala Val 


Glv 
j 


Ala 


Tvr 
j 


Leu 


He 


Gin 


His 


Tyr Phe 


Phe 






740 








745 










750 




Ser 


lie 


Met 


Gly Glu Asn 


Leu 


Thr 


Thr 


Ara 


Val 


Arg 


Arg 


Met Met 


Leu 














760 










765 






Ser 


Ala 


He 


Leu Arg Asn 


Glu 


Val 


Glv 


Trp 


Phe 


Asp 


Glu 


Asp Glu 


His 




770 








775 










780 








Asn 


Ser 


Ser 


Leu 


He Ala 


Ala 


Arg 


Leu 


Ala 


Thr 


Asp 


Ala 


Ala Asp 


Val 


785 








790 










795 








800 


Lys 


Ser 




He 


Ala Glu 


Arg 


He 


Ser 


Val 


He 


Leu 


Gin 


Asn Met 


Thr 








805 








810 








815 




Ser 


Leu 


Leu 


Thr 


Ser Phe 


He 


Val 


Ala 


Phe 


He 


Val 


Glu 


Trp Arg 


Val 








820 








825 










830 




Ser 


Leu 


Leu 


He 


Leu Gly 


Thr 


Phe 


Pro 


Leu 


Leu 


Val 


Leu 


Ala Asn 


Phe 






835 






840 










845 






Ala 


Gin 


Gin 


Leu 


Ser Leu 


Lvs 


Gly 


Phe 


Ala 


Gly 


Asp 


Thr 


Ala Lys 


Ala 




850 








855 










860 








His 


Ala 


Lys 


Thr 


Ser Met 


He 


Ala 


Glv 


Glu 


Gly 


Val 


Ser 


Asn He 


Arg 


865 






870 










875 








880 


Thr 


Val 


Ala 


Ala 


Phe Asn 


Ala 


Gin 


Ser 


Lvs 


He 


Leu 


Ser 


Leu Phe 


Cys 










885 








890 








895 




His 


Glu 


Leu 


Arg Val Pro 


Gin 


Lys 


Arg 


Ser 


Leu 


Ser 


Leu 


Tyr Arg 


Ser 








900 








905 










910 




Gin 


Thr 


Ser 


Gly Phe Leu 


Phe 


Gly 


Leu 


Ser 


Gin 


Leu 


Ala 


Leu Tyr 


Gly 






915 








920 










925 






Ser 


Glu 


Ala 


Leu 


He Leu 


Trp 


Tyr 


Gly 


Ala 


His 


Leu 


Val 


Ser Lys 


Gly 




930 








935 










940 








Val 


Ser 


Thr 


Phe 


Ser Lys 


Val 


He 


Lys 


Val 


Phe 


Val 


Val 


Leu Val 


He 


945 








950 










955 








960 


Thr 


Ala 


Asn 


Ser 


Val Ala 


Glu 


Thr 


Val 


Ser 


Leu 


Ala 


Pro 


Glu He 


He 










965 








970 








975 




Arg 


Gly 


Gly 


Glu 


Ala Val 


Gly 


Ser 


Val 


Phe 


Ser 


Val 


Leu 


Asp Arg 


Gin 




980 








985 










990 




Thr Arg 


He 


Asp 


Pro Asp 


Asp 


Ala 


Asp 


Ala 


Asp 


Pro 


Val 


Glu Thr 


He 






995 






1000 








1005 




Arg 


Gly 


Asp 


He 


Glu Phe 


Arg 


His 


Val 


Asp 


Phe 


Ala 


Tyr 


Pro Ser 


Arg 


1010 






1015 








1020 






Pro 


Asp 

5 


Val 


Met 


Val Phe 


Arg Asp 


Phe 


Asn 


Leu 


Arg 


He Arg Ala 


Gly 


1021 






1030 








1035 






1040 


His 


Ser 


Gin 


Ala 


Leu Val 


Gly Ala 


Ser 


Gly 


Ser 


Gly 


Lys 


Ser Ser 


Val 










1045 








1050 






1055 


lie 


Ala 


Met 


He 


Glu Arg 


Phe 


Tyr Asp 


Leu 


Leu 


Ala 


Gly 


Lys Val 


Met 








1060 






1065 








107 0 




lie Asp Gly 


Lys Asp He 


Arg Arg 


Leu 


Asn 


Leu 


Lys 


Ser 


Leu Arg 


Leu 






1075 






1080 








1085 




Lys 


He Gly 


Leu 


Val Gin 


Gin 


Glu 


Pro 


Ala 


Leu 


Phe 


Ala 


Ala Thr 


He 


1090 






1095 








1100 






Phe Asp Asn lie Ala Tyr Gly Lys Asp Gly Ala 


Thr 


Glu 


Ser Glu 


Val 
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1105 1110 1115 1120 

He Asp Ala Ala Arg Ala Ala Asn Ala His Gly Phe He Ser Gly Leu 

1125 1130 1135 

Pro Glu Gly Tyr Lys Thr Pro Val Gly Glu Arg Gly Val Gin Leu Ser 

1140 1145 1150 

Gly Gly Gin Lys Gin Arg He Ala He Ala Arg Ala Val Leu Lys Asn 

1155 1160 1165 

Pro Thr Val Leu Leu Leu Asp Glu Ala Thr Ser Ala Leu Asp Ala Glu 

1170 1175 1180 

Ser Glu Cys Val Leu Gin Glu Ala Leu Glu Arg Leu Met Arg Gly Arg 
1185 "* 1190 1195 1200 

Thr Thr Val Val Val Ala His Arg Leu Ser Thr He Arg Gly Val Asp 

1205 ' 1210 1215 

Cys He Gly Val lie Gin Asp Gly Arg lie Val Glu Gin Gly Ser His 

1220 1225 1230 

Ser Glu Leu Val Ser Arg Pro Glu Gly Ala Tyr Ser Arg Leu Leu Gin 

1235 1240 1245 

Leu Gin Thr His Arg He 
1250 

<210> 3 
<211> 1280 
<212> PRT 

<213> Homo sapiens 
<300> 

<308> Genbank P08183 
<309> 1997-11-01 



<400> 3 



Met 


Asp 


Leu 


Glu 


Gly 


Asp 


Arg 


Asn 


Gly Gly Ala 


Lys Lys 


Lys 


Asn 


Phe 


1 






5 








10 






15 




Phe 


Lys 


Leu 


Asn 


Asn 


Lys 


Ser 


Glu 


Lys Asp Lys 


Lys Glu 


Lys 


Lys 


Pro 






20 










25 




30 






Thr 


Val 


Ser 
35 


Val 


Phe 


Ser 


Met 


Phe 
40 


Arg Tyr Ser 


Asn Trp 
45 


Leu 


Asp 


Lys 


Leu 


Tyr 


Met 


Val 


Val 


Gly 


Thr 


Leu 


Ala Ala lie 


lie His 


Gly 


Ala 


Gly 




50 








55 






60 








Leu 


Pro 


Leu 


Met 


Met 


Leu 


Val 


Phe 


Gly Glu Met 


Thr Asp 


lie 


Phe 


Ala 


65 










70 






75 








80 


Asn 


Ala 


Gly 


Asn 


Leu 


Glu 


Asp 


Leu 


Met Ser Asn 


lie Thr 


Asn 


Arg 


Ser 








85 








90 






95 




Asp 


He 


Asn 


Asp 


Thr 


Gly 


Phe 


Phe 


Met Asn Leu 


Glu Glu 


Asp 


Met 


Thr 






100 










105 




110 






Arg 


Tyr 


Ala 
115 


Tyr 


Tyr 


Tyr 


Ser 


Gly 
120 


lie Gly Ala 


Gly Val 
125 


Leu 


Val 


Ala 


Ala 


Tyr 
130 


lie 


Gin 


Val 


Ser 


Phe 
135 


Trp 


Cys Leu Ala 


Ala Gly 
140 


Arg 


Gin 


lie 


His 


Lys 


lie 


Arg 


Lys 


Gin 


Phe 


Phe 


His Ala He 


Met Arg 


Gin 


Glu 


lie 


145 






150 






155 








160 


Gly 


Trp 


Phe 


Asp 


Val 


His 


Asp 


Val 


Gly Glu Leu 


Asn Thr 


Arg 


Leu 


Thr 








165 








170 






175 




Asp 


Asp 


Val 


Ser 
180 


Lys 


lie 


Asn 


Glu 


Val lie Gly 
185 


Asp Lys 


lie 
190 


Gly 


Met 


Phe 


Phe 


Gin 
195 


Ser 


Met 


Ala 


Thr 


Phe 
200 


Phe Thr Gly 


Phe lie 
205 


Val 


Gly 


Phe 


Thr 


Arg 
210 


Gly 


Trp 


Lys 


Leu 


Thr 

215 


Leu 


Val He Leu 


Ala He 
220 


Ser 


Pro 


Val 


Leu 


Gly 


Leu 


Ser 


Ala 


Ala 


Val 


Trp 


Ala Lys He 


Leu Ser 


Ser 


Phe 


Thr 


225 








230 






235 








240 


Asp 


Lys 


Glu 


Leu 


Leu 


Ala 


Tyr 


Ala 


Lys Ala Gly 


Ala Val 


Ala 


Glu 


Glu 






245 








250 






255 
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Val 


Leu 


Ala 


Ala 


He Arg Thr Val He 


Ala 








260 




265 




Glu 


Leu 


Glu 


Arg 


Tyr Asn Lys Asn Leu 


Glu 






275 






280 




He 


Lys 


Lys 


Ala 


He 


Thr Ala Asn He 


Ser 




290 








295 




Leu 


He 


Tyr 


Ala 


Ser Tyr Ala Leu Ala 


Phe 


305 










310 




Val 


Leu 


Ser 


Gly 


Glu Tyr Ser He Gly 


Gin 










325 




330 


Ser 


Val 


Leu 


He 


Gly Ala Phe Ser Val 


Gly 








340 




345 




Glu 


Ala 


Phe 


Ala 


Asn Ala Arg Gly Ala 


Ala 






355 






360 




He Asp 


Asn 


Lys 


Pro 


Ser lie Asp Ser 


Tyr 




370 








375 




Pro Asp 


Asn 


He 


Lys 


Gly Asn Leu Glu 


Phe 


385 










390 




Tyr 


Pro 


Ser 


Arg 


Lys 


Glu Val Lys He 


Leu 








405 




410 


Val 


Gin 


Ser 


Gly 


Gin 


Thr Val Ala Leu 


Val 








420 




425 




Lys 


Ser 


Thr 


Thr 


Val 


Gin Leu Met Gin 


Arg 




435 






440 




Gly Met 


Val 


Ser 


Val 


Asp Gly Gin Asp 


lie 




450 








455 




Phe 


Leu 


Arg 


Glu 


He 


He Gly Val Val 


Ser 


465 










470 




Ala 


Thr 


Thr 


He 


Ala Glu Asn He Arg 


Tyr 










485 




490 


Met Asp 


Glu 


He 


Glu Lys Ala Val Lys 


Glu 








500 




505 




He 


Met 


Lys 


Leu 


Pro 


His Lys Phe Asp 


Thr 






515 






520 




Ala 


Gin 


Leu 


Ser 


Gly Gly Gin Lys Gin 


Arg 




530 








535 




Leu 


Val 


Arg 


Asn 


Pro 


Lys lie Leu Leu 


Leu 


545 










550 




Leu Asp 


Thr 


Glu 


Ser 


Glu Ala Val Val 


Gin 










565 




570 


Arg 


Lys 


Gly 


Arg 


Thr 


Thr He Val lie 


Ala 








580 




585 




Arg 


Asn 


Ala 


Asp 


Val 


He Ala Gly Phe 


Asp 






595 






600 




Lys 


Gly 


Asn 


His 


Asp 


Glu Leu Met Lys 


Glu 




610 








615 




Leu 


Val 


Thr 


Met 


Gin 


Thr Ala Gly Asn 


Glu 


625 










630 




Ala Asp 


Glu 


Ser 


Lys 


Ser Glu lie Asp 


Ala 










645 




650 


Asp 


Ser 


Arg 


Ser 


Ser 


Leu He Arg Lys 


Arg 








660 




665 




Arg Gly 


Ser 


Gin 


Ala 


Gin Asp Arg Lys 


Leu 






675 






680 




Asp 


Glu 


Ser 


He 


Pro 


Pro Val Ser Phe 


Trp 




690 








695 




Leu 


Thr 


Glu 


Trp 


Pro Tyr Phe Val Val 


r> 1 


705 










710 




Asn Gly 


Gly 


Leu 


Gin 


Pro Ala Phe Ala 


He 










725 




730 


Gly Val 


Phe 


Thr 


Arg 


He Asp Asp Pro 


Glu 








740 




745 
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Phe 


Gly 


Gly 


Gin 


Lys 


Lys 








270 






Glu 


Ala 


Lys 


Arg 


He 


Gly 






285 








lie 


Gly 


Ala 


Ala 


Phe 


Leu 




300 










Trp 


Tyr 


Gly 


Thr 


Thr 


Leu 


315 










320 


Val 


Leu 


Thr 


Val 


Phe 


Phe 










335 




Gin 


Ala 


Ser 


Pro 


Ser 


He 








350 






Tyr 


Glu 


lie 


Phe 


Lys 


He 






365 








Ser 


Lys 


Ser 


Gly 


His 


Lys 




380 










Arg 


Asn 


Val 


His 


Phe 


Ser 


395 










400 


Lys 


Gly 


Leu 


Asn 


Leu 


Lys 










415 




Gly 


Asn 


Ser 


Gly 


Cys 


Gly 








430 






Leu 


Tyr 


Asp 


Pro 


Thr 


Glu 






445 








Arg 


Thr 


lie 


Asn 


Val 


Arg 




460 










Gin 


Glu 


Pro 


Val 


Leu 


Phe 


475 










480 


Gly 


Arg 


Glu 


Asn 


Val 


Thr 










495 




Ala 


Asn 


Ala 


Tyr 


Asp 


Phe 








510 






Leu 


Val 


Gly 


Glu 


Arg 


Gly 






525 








He 


Ala 


lie 


Ala 


Arg 


Ala 




540 










Asp 


Glu 


Ala 


Thr 


Ser 


Ala 


555 










560 


Val 


Ala 


Leu 


Asp 


Lys 


Ala 










575 




His 


Arg 


Leu 


Ser 


Thr 


Val 






590 






Asp 


Gly 


Val 


lie 


Val 


Glu 






605 








Lys 


Gly 


lie 


Tyr 


Phe 


Lys 




620 










Val 


Glu 


Leu 


Glu 


Asn 


Ala 


635 










640 


Leu 


Glu 


Met 


Ser 


Ser 


Asn 










655 




Ser 


Thr 


Arg 


Arg 


Ser 


Val 








670 






Ser 


Thr 


Lys 


Glu 


Ala 


Leu 






685 








Arg 


lie 


Met 


Lys 


Leu 


Asn 




700 










Val 


Phe 


- 


Ala 


He 


lie 


715 










720 


lie 


Phe 


Ser 


Lys 


He 


He 










735 




Thr 


Lys 


Arg 


Gin 


Asn 


Ser 



750 
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Asn 


Leu 


Phe 
755 


Ser 


Leu 


Leu 


Phe 


Leu 
760 


Ala 


Leu 


Gly 


He 


He 
765 


Ser 


Phe 


He 


Thr 


Phe 
770 


Phe 


Leu 


Gin 


Gly 


Phe 
775 


Thr 


Phe 


Gly 


Lys 


Ala 

780 


Gly 


Glu 


He 


Leu 


1 llJL 


Lys 




Leu 


Arg 


Tvr 


Met 


Val 


Phe 


Arg 


Ser 


Met 


Leu 


Arg 


Gin 


Asp 


Ift^ 






790 










795 










800 


Val 


Ser 




Phe 


Asp 


Asp 


Pro 


Lys 


Asn 


Thr 


Thr 


Gly 


Ala 


Leu 


Thr 


Thr 








AOS 










810 










815 




Arg 


Leu 


Ala 


Asn 


Asp 


Ala 


Ala 


Gin 


Val 


Lys 


Gly 


Ala 


He 


Gly 


Ser 


Arg 






ft9 fl 










ft? ^ 

O Ct *J 










830 






Leu 


Ala 


Val 

Q "3 C 


He 


Thr 


Gin 


Asn 


He 
ft a n 

OH.V 


Ala 


Asn 


Leu 


Gly 


Thr 
845 


Gly 


He 


He 


He 


£>er 

OCA. 
ODV 


rne 


lie 


Tyr 


(11 v 


irp 

ft R 1 ^ 


Gin 


Leu 




Leu 


Leu 
860 


Leu 


Leu 


Ala 


lie 


Val 


Pro 


lie 


lie 


/\-L a 


Tip 

l ie 


Al a 


vj-L y 


V ctX 


Val 
v cij. 


Glu 


Met 


Lys 


Met 


Leu 


Ser 


one 
boo 










rt n 

0 / u 










875 










880 


Gly 


Gin 


Ala 


Leu 


Lys 


ASp 


Lys 


ijys 


o _i_ U 




^ -L Li 


Glv 


Ala 


Glv 


Lys 


He 








ope 










ft QD 
o y u 










895 




Ala 


Thr 


GlU 


Ala 

q a a 
y uu 


He 


Glu 


Asn 


Phe 


Arg 


Thr 


Val 


Val 


Ser 


Leu 
910 


Thr 


Gin 


GlU 


Gin 


Lys 

QIC 

y id 


rne 


Glu 


nis 




Tyr 
920 




ulll 


•JCI 




Gin 
925 


Val 


Pro 


Tvr 


Arg 


Asn 


Ser 


T mi 

jjeu 


Arg 


Lys 


AJ.a 


His 


He 


rue 


oxy 


Tl 

lie 


Thr 


Phe 


Ser 


Phe 


930 










935 










940 










Thr 


Gin 


Ala 


Me u 


jyier 


Tyr 


Phe 


Ser 


Tyr 


Al a 


o_l y 


Cys 


Phe 


Arg 


Phe 


Glv 


945 










950 










955 










960 


Ala 


Tyr 


Leu 


Val 


Ala 


His 


Lys 


Leu 


Met 


Ser 


Phe 


Glu 


Asp 


Val 


Leu 


Leu 








965 










970 










975 




Val 


Phe 


Ser 


Ala 

q q a 
you 


Val 


Val 


Phe 


Gly 


Ala 
985 


Met 


Ala 


Val 


Gly 


Gin 
990 


Val 


Ser 


Ser 


Phe 


Ala 


Pro 


Asp 


Tyr 


Ala 


Lys 


Ala 


Lys 


lie 


Ser 


Ala 


Ala 


His 


He 






995 








1000 








1005 






He 


Met 


He 


He 


Glu 


Lys 


Thr 


Pro 


Leu 


He 


Asp 


Ser 


Tyr 


Ser 


Thr 


Glu 




1010 








1015 








1020 








Gly 


Leu 


Met 


Pro 


Asn 


Thr 


Leu 


Glu 


Gly 


Asn 


Val 


Thr 


Phe Gly 


Glu 


Val 


1025 








1030 








1035 








1040 


Val 


Phe 


Asn 


Tyr 


Pro 


Thr Arg 


Pro Asp 


He 


Pro 


Val 


Leu 


Gin 


Gly 


Leu 








1045 








1050 








1055 


Ser 


Leu 


Glu 


Val 


Lys 


Lys 


Gly Gin Thr 


Leu 


Ala 


Leu 


Val 


Gly 


Ser 


Ser 








106( 








1065 








1070 




Gly Cys 


Gly 


Lys 


Ser 


Thr 


Val 


Val 


Gin 


Leu 


Leu 


Glu Arg 


Phe 


Tyr Asp 



1075 1080 1085 



Pro Leu Ala Gly Lys Val Leu Leu Asp Gly Lys Glu He Lys Arg Leu 

1090 ' 1095 1100 

Asn Val Gin Trp Leu Arg Ala His Leu Gly lie Val Ser Gin Glu Pro 
1105 * 1110 1115 1120 

He Leu Phe Asp Cys Ser lie Ala Glu Asn lie Ala Tyr Gly Asp Asn 

1125 1130 1135 

Ser Arg Val Val Ser Gin Glu Glu lie Val Arg Ala Ala Lys Glu Ala 

1140 1145 1150 

Asn lie His Ala Phe lie Glu Ser Leu Pro Asn Lys Tyr Ser Thr Lys 

1155 1160 1165 

Val Gly Asp Lys Gly Thr Gin Leu Ser Gly Gly Gin Lys Gin Arg He 

1170 " 1175 1180 

Ala lie Ala Arg Ala Leu Val Arg Gin Pro His He Leu Leu Leu Asp 
1185 1190 1195 1200 

Glu Ala Thr Ser Ala Leu Asp Thr Glu Ser Glu Lys Val Val Gin Glu 

1205 1210 _ 1215 

Ala Leu Asp Lys Ala Arg Glu Gly Arg Thr Cys lie Val lie Ala His 

1220 1225 1230 

Arg Leu Ser Thr lie Gin Asn Ala Asp Leu lie Val Val Phe Gin Asn 
1235 1240 1245 
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Gly Arg Val Lys Glu His Gly Thr His Gin Gin Leu Leu Ala Gin Lys 

1250 1255 1260 

Gly He Tyr Phe Ser Met Val Ser Val Gin Ala Gly Thr Lys Arg Gin 
1265 ' 1270 1275 1280 

<210> 4 
<211> 1175 
<212> PRT 

<213> Mus musculus 



<300> 

<308> Genbank P06795 
<309> 1998-07-15 



<400> 4 



Ser 


Asn 


Ser 


Ser 


Leu 


Glu 


Glu 


Glu 


1 








5 








Thr 


Gly 


He 


Gly 


Ala 


Gly 


Val 


Leu 








20 










Leu 


Trp 


Cvs 


Leu 


Ala 


Ala 


Gly 


Arg 






35 










40 


Phe 


Phe 


His 


Ala 


He 


Met 


Asn 


Gin 




50 










55 




Asp 


Val 


Gly 


Glu 


Leu 


Asn 


Thr 


Arg 


65 










70 






Asn 


Asp 


Gly 


lie 


Gly 


Asp 


Lys 


He 










85 








Thr 


Phe 


Leu 


Ala 


Gly 


Phe 


He 


lie 








100 










Thr 


Leu 


Val 


lie 


Leu 


Ala 


Val 


Ser 






115 










120 


Leu 


Trp 


Ala 


Lys 


Val 


Leu 


Thr 


Ser 




130 










135 




Tyr 


Ala 


Lys 


Ala 


Gly 


Ala 


Val 


Ala 


145 










150 






Thr 


Val 


He 


Ala 


Phe 


Gly 


Gly 


Gin 










165 








Lys 


Asn 


Leu 


Glu 


Glu 


Ala 


Lys 


Asn 








180 










Ala 


Ser 


He 


Ser 


He 


Gly 


He 


Ala 






195 










200 


Ala 


Leu 


Ala 


Phe 


Trp 


Tyr 


Gly 


Thr 




210 










215 




Ser 


He 


Gly 


Glu 


Val 


Leu 


Thr 


Val 


225 








230 






Phe 


Ser 


He 


Gly 


His 


Leu 


Ala 


Pro 










245 








Arg 


Gly 


Ala 


Ala 


Phe 


Glu 


He 


Phe 








260 










He 


Asp 


Ser 


Phe 


Ser 


Thr 


Lys 


Gly 






275 










280 


Asn 


Leu 


Glu 


Phe 


Lys 


Asn 


Val 


His 




290 










295 




Val 


Gin 


He 


Leu 


Lys 


Gly 


Leu 


Asn 


305 










310 






Val 


Ala 


Leu 


Val 


Gly 


Asn 


Ser 


Gly 










325 








Leu 


Met 


Gin 


Arg 


Leu 


Tyr 


Asp 


Pro 








340 










Gly 


Gin 


Asp 


He 


Arg 


Thr 


He 


Asn 






355 










360 


Gly 


Val 


Val 


Ser 


Gin 


Glu 


Pro 


Val 



Met Ala 


He 


Tyr Ala Tyr 


Tyr Tyr 


10 








15 


He Val 


Ala 


Tyr lie Gin 


Val Ser 


25 






30 




Gin He 


His 


Lys 


lie Arg 


Gin Lys 








45 




Glu lie Gly 


Trp 


Phe Asp 


Val His 






60 






Leu Thr Asp 


Asp Val Ser 


Lys lie 




75 






80 


Gly Met 


Phe 


Phe 


Gin Ser 


lie Thr 


90 








95 


Gly Phe lie 


Ser Gly Trp 


Lys Leu 


105 






110 




Pro Leu 


He 


Gly Leu Ser 


Ser Ala 








125 




Phe Thr Asn 


Lys 


Glu Leu 


Gin Ala 






140 






Glu Glu Val 


Leu 


Ala Ala 


lie Arg 




155 






160 


Gin Lys 


Glu 


Leu 


Glu Arg 


Tyr Asn 


170 








175 


Val Gly 


He 


Lys 


Lys Ala 


lie Thr 


185 






190 




Tyr Leu 


Leu 


Val 


Tyr Ala 


Ser Tyr 








205 




Ser Leu 


Val 


Leu 


Ser Asn 


Glu Tyr 






220 






Phe Phe 


Ser 


lie 


Leu Leu 


Gly Thr 




235 






240 


Asn He 


Glu 


Ala 


Phe Ala 


Asn Ala 


250 








255 


Lys lie 


lie 


Asp Asn Glu 


Pro Ser 


265 






270 




Tyr Lys 


Pro 


Asp 


Ser He 


Met Gly 








285 




Phe Asn Tyr 


Pro 


Ser Arg 


Ser Glu 






300 






Leu Lys 


Val 


Lys 


Ser Gly 


Gin Thr 




315 






320 


Cys Gly Lys 


Ser 


Thr Thr 


Val Gin 


330 








335 


Leu Glu Gly 


Val 


Val Ser 


He Asp 


345 






350 




Val Arg Tyr 


Leu Arg Glu 


He lie 








365 




Leu Phe 


Ala 


Thr 


Thr He 


Ala Glu 
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A w-i 

Asn 


lie 


Arg 


Tyr 


-JOC 








Ala 


Val 


Lys 


Glu 


Gin 


fne 


Asp 


inr 










Gin 


Lys 


bin 


Arg 






4 3d 




lie 


Leu 


Leu 


Leu 




450 






Ala 


Val 


vai 


bin 


465 








lie 


Val 


He 


Ala 


Ala 


Gly 


Pne 


Asp 








500 


Leu 


Met 


Arg 


Glu 






515 




Arg 


Gly 


Asn 


Glu 




530 






Asp 


Thr 


Asp 


Ala 


545 








He Arg 


Arg 


Ser 


Arg Arg 


Leu 


Ser 








580 


Ser 


Phe 


Trp 


Arg 






595 




Leu 


Val 


Gly 


Val 




610 






Phe 


Ala 


He 


Val 


625 








Asp 


His 


Glu 


Thr 


Leu 


Val 


Met 


Gly 








660 


Thr 


Phe 


Gly 


Lys 






675 




Val 


Phe 


Lys 


Ser 




690 






Lys Asn 


Ser 


Thr 


705 








Ser 


Val 


Lys 


Gly 


Val 


Ala 


Asn 


Leu 








740 


Gin 


Leu 


Thr 


Leu 






755 




Gly 


He 


lie 


Glu 




770 






Lys 


Gin 


Leu 


Glu 


785 








Phe Arg 


Thr 


He 


Tyr Ala 


Gin 


Ser 








820 


His 


Val 


Phe 


Gly 






835 




Ser 


Tyr 


Ala 


Ala 




850 






Leu 


Met 


Thr 


Phe 



375 

Gly Arg Glu Asp 
390 

Ala Asn Ala Tyr 
405 

Leu Val Gly Glu 

lie Ala He Ala 
440 

Asp Glu Ala Thr 
455 

Ala Ala Leu Asp 
470 

His Arg Leu Ser 
485 

Gly Gly Val He 

Lys Gly He Tyr 
520 

He Glu Pro Gly 
535 

Ser Glu Leu Thr 
550 

He Tyr Arg Ser 
565 

Met Lys Glu Ala 

lie Leu Asn Leu 
600 

Leu Cys Ala Val 
615 

Phe Ser Arg He 
630 

Lys Arg Gin Asn 
645 

Leu He Ser Phe 

Ala Gly Glu lie 
680 

Met Leu Arg Gin 
695 

Gly Ser Leu Thr 
710 

Ala Met Gly Ala 
725 

Gly Thr Gly Val 

Leu Leu Val Val 
760 

Met Lys Leu Leu 
775 

lie Ser Gly Lys 
790 

Val Ser Leu Thr 
805 

Leu Gin Val Pro 

He Thr Phe Ser 
840 

Cys Phe Arg Phe 
855 

Glu Asn Val Met 



380 



Val 


Thr 


Met 


Asp 






395 




Asp 


Phe 


lie 


Met 




410 






A TTf 


Gly 


Ala 


Gin 


425 








Arg 


Ala 


Leu 


Val 


Ser 


Al a 


Leu 


nop 








*i \J U 


Lys 


ai a 


Arg 


OX u 






475 




i nr 


Va 1 

Val 


Arg 


Asn 




4 90 






Va 1 

va _l 


ulU 


Cl n 




505 








Phe 


Lys 


Leu 


Val 


Asn 


Asn 


Ala 


Tyr 










Ser 


CjIU 


blU 


Ser 






DDD 




vai 


HIS 


Arg 


Lys 




D J U 






vai 


Asp 


r~\ ii 

oXU 


Asp 


585 








Asn 


Leu 


Ser 


Glu 


lie 


Asn 


Cj±y 


Cys 










Vdl 


uiy 


Va 1 

va jl 


r lie 






DJJ 




Cys 


Asn 


Leu 


true 




£ c. n 






Va 1 


Thr 


iyj. 


Phe 


665 








Leu 


Thr 


Lys 


Arg 


Asp 


-LJ.e 


Ser 


irp 








700 


i nr 


Arg 


Leu 


Ala 






715 




Arg 


Leu 


Al a 


Val 




730 






Tip 

ixe 


Leu 




Leu 


745 








lie 


lie 


Pro 


Leu 


Ser 


Ljj.y 




Al A 
.rvX ct 








780 


lie 


Ala 


Thr 


Glu 






795 




Arg 


Glu 


Gin 


Lys 




810 






Tyr 


Arg 


Asn 


Ala 


625 








Phe 


Thr 


Gin 


Ala 


Gly 


Ala 


Tyr 


Leu 








860 


Leu 


Val 


Phe 


Ser 



Glu 


lie 


Glu 


Lys 








400 


Lvs 


Leu 


Pro 


His 




415 




Leu 


Ser 


Glv 


Glv 




430 






Arg 


Asn 


Pro 


Lvs 


445 








Thr 


Glu 


Ser 


Glu 


Gly 


Arg 


Thr 


Thr 








480 


Ala 


Asp 


Val 


lie 






495 




Asn 


His 


Asp 


Glu 




510 








JL 11JL 


Gin 


Thr 


525 








Gly 


Ser 


Gin 


Ser 


T vc 

jjys 


o er 


XT i- >-J 


Leu 






JUU 


n n 
uin 


ASp 




ui, LI 






575 




Vdl 




Leu 


v «x 




590 






T rn 
i xp 


Pro 


Tvr 
j 


Leu 


605 








He 


Gin 


Pro 


Val 




A rrt 


A 

AO p 


Asp 








640 




Leu 


Phe 


Phe 






655 




Phe 


Gin 


Gly 


Phe 




670 






Val 


Arg 


Tvr 


Met 


685 








Phe 


Asp 


Asp 


His 


Ser 


Asp 


Ala 


Ser 








720 


Val 


Thr 


Gin 


Asn 






735 




Val 




Gly 


TrD 

■ L Jr' 
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Glu 






885 




Thr 


Gin 


Glu 


Arg Lys Phe Glu 


Ser 








900 




Pro 


Tyr 


Arg 


Asn Ser Val Arg 


Lys 



14. 



4?^ 

H. O 






430 






Leu 


Leu 


Leu 




Thr 


Ser 








445 

*± *X -J 






val 


Val 


Gin 


Val Ala Leu 


Asp 


Lys 








4 fin 






vai 


He 


Ala 


n.-Lo -rt.x. y ijcu 


C a y» 


j. hi. 






475 






480 


Gly 


Phe Asp 


/\£>p o±y vai 


lie 


Val 




490 






d Q EL 




wet 


Lys 


Glu 


Jj_yj VJXJy lie 


iyi- 


Phe 


DUO 












Gly 


Asn 


Glu 


Vol oJLU licU 


ox u 


as n 












Asn 


Pro 


Lys 


lie Leu Leu 


• 

Leu 


Asp 








04 U 






GlU 


Ser 


Glu 


Ala vai vax 


vain 


ax a 






555 








Arg 


Thr 


Thr 


lie vai xxe 


Ala 


niS 


570 






ETC 

DID 




Asp 


Val 


He 


Ala Gly rne 


Asp 


r") i7 

^ly 


585 






con 

oy u 






His 


Asp 


Glu 


Leu Met Arg 


Glu 


T rrr 

Lys 








dUo 






Thr 


Gin 


Thr 


Arg Gly Asn 


blU 


Tin 

xxe 








C O A 






Leu 


Ser Asp 


Glu Lys Ala 


Ala 


Gly 






635 






a a n 
b4 U 


Ala 


Arg 


He 


rne Arg Asn 


q 0 v- 


i nr 




650 






OJJ 




Gin 


Asn Arg 


JjeU ASp taXU 


kjXU 


i nr 


665 






D / U 






Pro 


Val 


Ser 


jrne -Leu x*ys 


vajL. 


Leu 








C PI ^ 
DO J 






Tyr 


Phe 


Val 


vai biy inr 


V ax 


^ys 






/ uu 






Pro 


Ala 


Phe 


Cqv- Tin Tl Q 

oer lie lie 


Leu 


Cpv- 






715 






Ton 


Gly 


Asp Asp 


Aia vai i»ys 


o ±11 


Oj.Il 




730 






noc. 




rne 


Leu Gly 


LcU \jJ.y val 


Leu 


Ser 


/ 4o 






/ JU 






Phe 


Thr 


Phe 


taiy Liys Ai.a 


tjxy 










/ OJ 






Met 


Ala 


Phe 


juys Aia Met 


Leu 


Arg 








/oU 






nlS 


Lys Asn 


oer ini oiy 


rvJ- ct 


Leu 






795 








Ala 


Gin 


Val 


vain vj±y AJ.a 


Tnr 


fll T7 




810 






OiJ 




Asn 


Thr 


Ala 


Aon T iD i 1 (~1 ") it 

iicu uiy 


Th r 

1 11 JL 


Gly 














1 xp 


Gin 


Leu 


Thr Lpu Leu 


Leu 


Leu 






845 






Ala 


Gly He 


Val Glu Met 


Lys 


Met 








860 






Lys 


Lys 


Glu 


Met Glu Ala 


Ala 


Gly 






n n c 
O / «J 






880 


Asn 


He 


Arg 


Thr Val Val 


Ser 


Leu 




890 






895 




Met 


Tyr Val 


Glu Lys Leu 


His 


Gly 


905 






910 






Ala 


His 


He 


Tyr Gly He 


Thr 


Phe 
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15, 

915 920 925 

Ser lie Ser Gin Ala Phe Met Tyr Phe Ser Tyr Ala Gly Cys Phe Arg 

930 935 940 

Phe Gly Ser Tyr Leu He Val Asn Gly His Met Arg Phe Lys Asp Val 
945 950 955 960 

He Leu Val Phe Ser Ala lie Val Leu Gly Ala Val Ala Leu Gly His 

965 970 975 

Ala Ser Ser Phe Ala Pro Asp Tyr Ala Lys Ala Lys Leu Ser Ala Ala 

980 985 990 

Tyr Leu Phe Ser Leu Phe Glu Arg Gin Pro Leu He Asp Ser Tyr Ser 

995 1000 1005 

Gly Glu Gly Leu Trp Pro Asp Lys Phe Glu Gly Ser Val Thr Phe Asn 

1010 1015 1020 

Glu Val Val Phe Asn Tyr Pro Thr Arg Ala Asn Val Pro Val Leu Gin 
1025 1030 1035 1040 

Glv Leu Ser Leu Glu Val Lys Lys Gly Gin Thr Leu Ala Leu Val Gly 

1045 ~ 1050 1055 

Ser Ser Gly Cys Gly Lys Ser Thr Val Val Gin Leu Leu Glu Arg Phe 

1060 1065 1070 

Tyr Asp Pro Met Ala Gly Ser Val Leu Leu Asp Gly Gin Glu Ala Lys 

1075 1080 1085 

Lys Leu Asn Val Gin Trp Leu Arg Ala Gin Leu Gly He Val Ser Gin 

1090 ~ 1095 1100 

Glu Pro He Leu Phe Asp Cys Ser He Ala Glu Asn He Ala Tyr Gly 
1105 1110 1H5 H20 

Asp Asn Ser Arg Val Val Pro His Asp Glu He Val Arg Ala Ala Lys 

1125 1130 H35 

Glu Ala Asn He His Pro Phe He Glu Thr Leu Pro Gin Lys Tyr Asn 

1140 1145 1150 

Thr Arg Val Gly Asp Lys Gly Thr Gin Leu Ser Gly Gly Gin Lys Gin 

1155 ' 1160 H65 

Arg He Ala He Ala Arg Ala Leu He Arg Gin Pro Arg Val Leu Leu 

1170 H75 1180 

Leu Asp Glu Ala Thr Ser Ala Leu Asp Thr Glu Ser Glu Lys Val Val 
1185 1190 1195 1200 

Gin Glu Ala Leu Asp Lys Ala Arg Glu Gly Arg Thr Cys He Val He 

1205 1210 1215 

Ala His Arg Leu Ser Thr He Gin Asn Ala Asp Leu He Val Val He 

1220 1225 1230 

Glu Asn Gly Lys Val Lys Glu His Gly Thr His Gin Gin Leu Leu Ala 

1235 1240 1245 

Gin Lys Gly He Tyr Phe Ser Met Val Asn He Gin Ala Gly Thr Gin 

1250 1255 1260 

Asn Leu 
1265 

<210> 7 
<211> 1207 
<212> PRT 

<213> Arabidopsis thaliana 
<300> 

<308> Genbank A42150 
<309> 1997-03-13 

<400> 7 

Glu Lys Met Met Glu Glu Val Leu Lys Tyr Ala Leu Tyr Phe Leu Val 

15 10 15 

Val Gly Ala Ala He Trp Ala Ser Ser Trp Ala Glu He Ser Cys Trp 

20 25 30 

Met Trp Ser Gly Glu Arg Gin Thr Thr Lys Met Arg He Lys Tyr Leu 
35 " 40 45 
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Glu 


Ala 


Ala 


Leu 


Asn 


Gin 


Asp 


lie 




50 










55 




Thr 


Ser 


Asp 


Val 


Val 


Phe 


Ala 


lie 


65 










70 






Asp 


Ala 


He 


Ser 


Glu 


Lys 


Leu 


Gly 










85 








Phe 


Val 


Ser 


Gly 


Phe 


He 


Val 


Gly 








100 










Leu 


Val 


Thr 


Leu 


Ala 


Val 


Val 


Pro 






115 










120 


His 


Thr 


Thr 


Thr 


Leu 


Ser 


Lys 


Leu 




130 










135 




Ser 


Gin 


Ala 


Gly 


Asn 


He 


Val 


Glu 


145 










150 






Val 


Met 


Ala 


Phe 


Val 


Gly 


Glu 


Ser 










165 








Ala 


Leu 


Lys 


He 


Ala 


Gin 


Lys 


Leu 








180 










Gly 


Met 


Gly 


Leu 


Gly 


Ala 


Thr 


Tyr 




195 










200 


Leu 


Leu 


Leu 


Trp 


Tyr 


Gly 


Gly 


Tyr 




210 










215 




Gly 


Gly 


Leu 


Ala 


He 


Ala 


Thr 


Met 


225 










230 






Ala 


Leu 


Gly 


Gin 


Ser 


Ala 


Pro 


Ser 








245 








Val 


Ala 


Ala 


Ala 


Lys 


He 


Phe 


Arg 








260 










Glu 


Arg 


Asn 


Ser 


Glu 


Ser 


Gly 


Val 




275 










280 


Val 


Glu 


Leu 


Lys 


Asn 


Val 


Asp 


Phe 




290 










295 




Lys 


He 


Leu 


Asn 


Asn 


Phe 


Cys 


Leu 


305 










310 






Ala 


Leu 


Val 


Gly 


Ser 


Ser 


Gly 


Ser 










325 








He 


Glu 


Arg 


Phe 


Tyr 


Asp 


Pro 


Asn 








340 










Gin 


Asp 


Leu 


Lys 


Thr 


Leu 


Lys 


Leu 






355 










360 


Leu 


Val 


Ser 


Gin 


Glu 


Pro 


Ala 


Leu 




370 










375 




He 


Leu 


Leu 


Gly 


Arg 


Pro 


Asp 


Ala 


385 










390 






Ala 


Arg 


Val 


Ala 


Asn 


Ala 


His 


Ser 








405 








Phe 


Asp 


Thr 


Gin 


Val 


Gly 


Glu 


Arg 








420 










Lys 


Gin 


Arg 


lie 


Ala 


He 


Ala 


Arg 




435 










440 


Leu 


Leu 


Leu 


Asp 


Glu 


Ala 


Thr 


Ser 




450 










455 




Leu 


Val 


Gin 


Glu 


Ala 


Leu 


Asp 


Arg 


465 










470 






He 


He 


Ala 


His 


Arg 


Leu 


Ser 


Thr 










485 








Val 


Leu 


Gin 


Gin 


Gly 


Ser 


Val 


Ser 








500 










Phe 


Ser 


Lys 


Gly 


Glu 


Asn 


Gly 


Val 






515 










520 


Glu 


Ala 


Ala 


His 


Glu 


Thr 


Ala 


Met 




530 










535 





16 



Gin 


Phe 


Phe 


Asp 


Thr 


GlU 


val 


Arg 








60 










Asn 


Thr 


Asp 


Ala 


Val 


Met 


Val 


bin 






75 










o n 

OU 


Asn 


Phe 


He 


His 


Tyr 


Met 


Ala 


inr 




90 










9d 




Phe 


Thr 


Ala 


Val 


Trp 


Gin 


Leu 


A±a 


105 










110 






Leu 


lie 


Ala 


Val 


lie 


Gly 


Gly 


He 










125 








Ser 


Asn 


Lys 


Ser 


Gin 


Glu 


Ser 


Leu 








140 










Gin 


Thr 


Val 


Val 


Gin 


lie 


Arg 


Val 






155 










160 


Arg 


Ala 


Ser 


Gin 


Ala 


Tyr 


Ser 


Ser 




170 










175 




Gly 


Tyr 


Lys 


Thr 


Gly 


Leu 


Ala 


Lys 


185 










190 






Phe 


Val 


Val 


Phe 


Cys 


Cys 


Tyr 


Ala 










205 








Leu 


Val 


Arg 


His 


His 


Leu 


Thr 


Asn 








220 










Phe 


Ala 


Val 


Met 


lie 


Gly 


Gly 


Leu 






235 










240 


Met 


Ala 


Ala 


Phe 


Ala 


Lys 


Ala 


Lys 




250 










255 




He 


lie 


Asp 


His 


Lys 


Pro 


Thr 


lie 


265 










270 






Glu 


Leu 


Asp 


Ser 


Val 


Thr 


Gly 


Leu 








285 








Ser 


Tyr 


Pro 


Ser 


Arg 


Pro 


Asp 


Val 








300 










Ser 


Val 


Pro 


Ala 


Gly 


Lys 


Thr 


lie 






315 










320 


Gly 


Lys 


Ser 


Thr 


Val 


Val 


Ser 


Leu 


330 










335 




Ser 


Gly 


Gin 


Val 


Leu 


Leu 


Asp 


Gly 


345 










350 






Arg 


Trp 


Leu 


Arg 


Gin 


Gin 


lie 


Gly 










365 








Phe 


Ala 


Thr 


Ser 


He 


Lys 


Glu 


Asn 








380 










Asp 


Gin 


Val 


Glu 


lie 


Glu 


Glu 


Ala 




395 










400 


Phe 


He 


He 


Lys 


Leu 


Pro 


Asp 


Gly 




410 










415 




Gly 


Leu 


Gin 


Leu 


Ser 


Gly 


Gly 


Gin 


425 










430 






Ala 


Met 


Leu 


Lys 


Asn 


Pro 


Ala 


He 










445 








Ala- 


Leu 


Asp 


Ser 


Glu 


C n V- 

ber 


LjJ.U 


Lys 








460 










Phe 


Met 


He 


Gly 


Arg 


Thr 


Thr 


Leu 






475 










480 


He 


Arg 


Lys 


Ala 


Asp 


Leu 


Val 


Ala 




490 










495 




Glu 


lie 


Gly 


Thr 


His 


Asp 


Glu 


Leu 


505 








510 






Tyr 


Ala 


Lys 


Leu 


He 


Lys 


Met 


Gin 










525 








Ser 


Asn 


Ala 


Arg 


Lys 


Ser 


Ser 


Ala 



540 
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Arg 


Pro 


Ser 


Ser Ala Arg Asn Ser 


Val 


Ser Ser 


Pro He 


Met Thr Arg 


t> A ^ 








550 






555 




560 


Asn 


Ser 


ber 


Tyr Gly Arg 


Ser Pro 


Tyr 


Ser Arg Arg Leu 


Ser Asp Phe 








565 








570 




575 


Ser 


rpV, y 

inz 


Ser 


Asp Phe Ser Leu Ser 


He 


Asp Ala 


Ser Ser 


Tyr Pro Asn 








580 






SflS 

JOJ 






590 


Tyr 


Arg 


Asn 


Glu Lys 


Leu Ala Phe 


Lys 


Asp Gin Ala Asn 


Ser Phe Trr> 


o yo 






600 






605 




Arg 


Leu 


ax a 


Lys Met Asn 


Ser Pro 




Trp Lys Tyr Ala 


Leu Leu Gly 


olU 








615 






620 




Ser 


val 


Gly 


Ser Val 


He Cys Gly 


Ser 


Leu Ser 


Ala Phe 


Phe Ala Tvr 


625 






630 






635 




640 


Val 


Leu 


Ser 


Ala Val 


Leu 


Ser Val 


Tyr 


Tyr Asn 


Pro Asp 


His Glu Tvr 








645 






650 






Met 


He 


Lys 


Gin He Asp Lys Tyr 


Cys 


Tyr Leu 


Leu He 


Gl v T.ph Se»r 






660 






O DO 






670 


Ser 


Ala 


Ala 


Leu Val 


Phe 


Asn Thr 


j_»eu 


Gin His 


Ser Phe 








675 






680 






685 




Val 


Gly 


Glu 


Asn Leu Thr Lys Arg 


Val 


Arg Glu 


Lys Met 


Leu oer .ft..La 




690 








695 






700 




Val 


Leu 


Lys 


Asn Glu Met Ala Trp 


Phe 


Asp Gin 


Glu Glu 


nail bill ocl 


705 






710 






715 




790 


Ala 


Arg 


He 


Ala Ala Arg 


Leu Ala 


Leu 


Asp Ala 


Asn Asn 


vax Axg oci 






725 








730 




/ JJ 


Ala 


He 


Gly 


Asp Arg 


He 


Ser Val 


lie 


Val Gin 


Asn Thr 


nia XacU, Flcl- 






740 






745 






i ^ n 


Leu 


Val 


Ala 


Cys Thr 


Ala 


Gly Phe 


val 


Leu Gin 


Trp Arg 








755 






760 






765 




Val 


Leu 


Val 


Ala Val 


Phe 


Pro Val 


vai 


Val Ala 


Ala Thr 


Va 1 T.fMi G1 n 

val IlCU V3J.il 




770 








775 






780 




Lys 


Met 


Phe 


Met Thr 


Gly 


Phe Ser 


oj.y 


Asp Leu 


Glu Ala 


Ala His Ala 


785 








790 






795 




800 


Lys 


Gly 


Thr 


Gin Leu Ala Gly Glu 


Axa 


He Ala 


Asn Val 


Arn Thr Val 




805 








810 




OX J 


Ala 


Ala 


Phe 


Asn Ser 


Glu Ala Lys 


l re 


Val Arg Leu Tyr 


lilX Ala 1 i 








820 














Leu 


Glu 


Pro 


Pro Leu 


Lys Arg Cys 


rne 


Trp Lys 


Gly Gin 


Tie* Ala Glv 






835 






840 






845 




Ser 


Gly 


Tyr 


Gly Val Ala 


Gin Phe 


Cys 


Leu Tyr Ala Ser 


Tvr Ala Leu 




o c n 
oou 






855 






860 




Gly 


Leu 


Trp 


Tyr Ala 


Ser 


Trp Leu 


vai 


Lys His 


Gly He 


Qpr A«tn Ph^ 


ore 

865 








870 






875 




880 


Ser 


Lys 


inr 


He Arg Val 


Phe Met 


Vai. 


Leu Met 


Val Ser 


Al a Asn Glv 






885 








890 




895 


Ala 


Ala 


Glu 


Thr Leu 


Thr 


Leu Ala 


Pro 


Asp Phe He Lys 


f^l v Gl v Gl n 








900 






ant; 

y uo 






J A. \J 


Ala 


Met 


Arg 


Ser Val 


Phe 


Glu Leu 


Leu 


Asp Arg Lys Thr 


G'\u Tie Glu 

OX U. 11C ul U 






915 






920 






925 




Pro 


Asp 


Asp 


Pro Asp 


Thr 


Thr Pro 


Val 


Pro Asp Arg Leu 


Arg Gly Glu 




930 




935 






940 




Val 


Glu 


Leu 


Lys His 


He 


Asp Phe 


Ser 


Tyr Pro 


Ser Arg 


Pro Asp lie 


945 






950 






955 




960 


Gin 


He 


Phe 


Arg Asp 


Leu 


Ser Leu 


Arg 


Ala Arg Ala Gly 


Lys Thr Leu 








965 








970 




975 


Ala 


Leu 


Val 


Gly Pro 


Ser 


Gly Cys 


Gly 


Lys Ser 


Ser Val 


lie Ser Leu 








980 






985 






990 


lie 


Gin 


Arg 


Phe Tyr 


Glu 


Pro Ser 


Ser 


Gly Arg 


Val Met 


lie Asp Gly 






995 






1000 




1005 


Lys 


Asp 


He 


Arg Lys 


Tyr Asn Leu Lys 


Ala He Arg Lys 


His He Ala 




1010 






1015 






1020 




He 


Val 


Pro 


Gin Glu 


Pro 


Cys Leu 

3 


Phe 


Gly Thr Thr He Tyr Glu Asn 


1025 






103( 




1035 


1040 
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lie Ala Tyr Gly His Glu Cys Ala Thr Glu Ala Glu lie He Gin Ala 

1045 1050 1055 

Ala Thr Leu Ala Ser Ala His Lys Phe He Ser Ala Leu Pro Glu Gly 

1060 1065 1070 

Tyr Lys Thr Tyr Val Gly Glu Arg Gly Val Gin Leu Ser Gly Gly Gin 

1075 " 1080 1085 

Lys Gin Arg He Ala He Ala Arg Ala Leu Val Arg Lys Ala Glu He 

1090 1095 1100 

Met Leu Leu Asp Glu Ala Thr Ser Ala Leu Asp Ala Glu Ser Glu Arg 
1105 1110 1115 1120 

Ser Val Gin Glu Ala Leu Asp Gin Ala Cys Ser Gly Arg Thr Ser He 

1125 1130 1135 

Val Val Ala His Arg Leu Ser Thr He Arg Asn Ala His Val He Ala 

1140 1145 1150 

Val lie Asp Asp Gly Lys Val Ala Glu Gin Gly Ser His Ser His Leu 

1155 ' 1160 1165 

Leu Lys Asn His Pro Asp Gly He Tyr Ala Arg Met He Gin Leu Gin 

1170 1175 1180 

Arg Phe Thr His Thr Gin Val He Gly Met Thr Ser Gly Ser Ser Ser 
1185 1190 1195 1200 

Arg Val Lys Glu Asp Asp Ala 
1205 



<210> 8 
<211> 1161 
<212> PRT 

<213> Arabidopsis thaliana 
<300> 

<308> Genbank CAA71277 
<309> 1997-05-19 



<400> 


8 










Lys Gin 


Ala 


Ser 


His 


Arg 


Val 


Ala 


1 






5 








Leu Ser 


Val 


Ala 


He 


Leu 


Phe 


Ser 






20 










Met His 


Thr 


Gly 


Glu 


Arg 


Gin 


Ala 




35 










40 


Arg Ser 


Met 


Leu 


Ser 


Gin 


Asp 


He 


50 










55 




Thr Gly 


Glu 


Val 


lie 


Ser 


Ala 


He 


65 








70 






Asp Ala 


Leu 


Ser 


Glu 


Lys 


Val 


Gly 








85 








Phe He 


Ala 


Gly 


Phe 


Ala 


He 


Gly 






100 










Leu Val 


Thr 


Leu 


Ser 


lie 


Val 


Pro 




115 










120 


Tyr Ala 


Phe 


Val 


Ala 


lie 


Gly 


Leu 


130 










135 




lie Lys 


Ala 


Gly 


Glu 


lie 


Ala 


Glu 


145 








150 






Val Gin 


Ala 


Phe 


Thr 


Gly 


Glu 


Glu 








165 








Ala Leu 


Glu 


Asn 


Thr 


Tyr 


Lys 


Tyr 






180 










Gly Leu 


Gly 


Leu 


Gly 


Ser 


Met 


His 




195 










200 


Leu Leu 


Val 


Trp 


Phe 


Thr 


Ser 


Val 


210 










215 




Gly Gly 


Lys 


Ser 


Phe 


Thr 


Thr 


Met 



Lys 


Tyr 


Ser 


Leu Asp Phe 


Val Tyr 




10 






15 


Ser 


Trp 


Leu 


Glu Val Ala 


Cys Trp 


25 






30 




Ala 


Lys 


Met 


Arg Arg Ala 


Tyr Leu 








45 




Ser 


Leu 


Phe 


Asp Thr Glu 


Ala Ser 








60 




Thr 


Ser 


Asp 


lie Leu Val 


Val Gin 






75 




80 


Asn 


Phe 


Leu 


His Tyr lie 


Ser Arg 




90 






95 


Phe 


Thr 


Ser 


Val Trp Gin 


lie Ser 


105 






110 




Leu 


lie 


Ala 


Leu Ala Gly 


Gly He 








125 




lie Ala 


Arg 


Val Arg Lys 


Ser Tyr 








140 




Glu 


Val 


lie 


Gly Asn Val 


Arg Thr 






155 




160 


Arg Ala 


Val 


Arg Leu Tyr 


Arg Glu 




170 






175 


Gly Arg 


Lys 


Ala Gly Leu 


Thr Lys 


185 










Cys 


Val 


Leu 


Phe Leu Ser 


Trp Ala 






205 




Val 


Val 


His 


Lys Asp lie 


Ala Asp 








220 




Leu 


Asn 


Val 


Val lie Ala 


Gly Leu 
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225 










230 




Ser Leu Gly Gin Ala 


Ala 


Pro Asp 










245 






Ala 


Ala 


Ala 


Tyr 


Pro 


He 


Phe Lys 








260 








Lys 


Thr 


Ser 


Ala 


Lys 


Ser 


Gly Arg 




275 








280 


lie 


Gin 


Phe 


Lys Asp 


Ala 


Thr Phe 




290 










295 


Val 


lie 


Phe Asp Arg 


Leu 


Asn Leu 


305 










310 




Ala Leu Val Gly Gly 


Ser 


Gly Ser 










325 






lie 


Glu Arg 


Phe Tyr 


Glu 


Pro He 








340 








Asn 


Asn 


He 


Ser 


Glu 


Leu 


Asp lie 






355 








360 


Leu 


Val 


Asn 


Gin 


Glu 


Pro 


Ala Leu 




370 










375 


lie Leu Tyr Gly Lys 


Asp 


Asp Ala 


385 










390 




Ala 


Lys 


Leu 


Ser 


Glu 


Ala 


He Ser 








405 






Phe 


Glu 


Thr 


Gin 


Val 


Gly 


Glu Arg 








420 








Lys 


Gin Arg 


He 


Ala 


He 


Ser Arg 




435 








440 


Leu 


Leu 


Leu Asp 


Glu 


Ala 


Thr Ser 




450 










455 


Ser 


Val 


Gin 


Glu Ala 


Leu 


Asp Arg 


465 










470 




Val 


Val 


Ala 


His 


Arg 


Leu 


Ser Thr 










485 






Val 


Val 


His 


Glu 


Gly 


Lys 


He Val 








500 








lie 


Ser 


Asn 


Pro Asp 


Gly 


Ala Tyr 






515 








520 


Thr 


Ala 


Ser 


Leu 


Gin 


Arg 


Asn Pro 




530 










535 


Pro 


His 


Ser 


He 


Lys 


Tyr 


Ser Arg 


545 










550 


Ser Val 


Phe Cys 


Ser Glu Arg 


Glu 










565 






Ser 


Lys 


Lys 


Val 


Lys 


Val 


Thr Val 








580 








Pro Asp 


Trp 


Met 


Tyr 


Gly 


Val Cys 






595 








600 


Gly 


Ser 


Gin 


Met 


Pro 


Leu 


Phe Ala 


610 










615 


Ser 


Tyr Tyr 


Ser 


Gly 


Trp 


Asp Glu 


625 










630 




Ala 


He 


Leu 


Phe 


Cys 


Cys 


Ala Ser 










645 






lie 


Glu 


His 


He 


Cys 


Phe 


Gly Thr 








660 








Val 


Arg 


Glu 


Asn 


Met 


Phe 


Arg Ala 






675 








680 


Phe 


Asp 


Glu 


Val 


Asp 


Asn 


Thr Ser 




690 










695 


Ser Asp 


Ala 


Thr 


Leu 


Leu 


Lys Thr 


705 










710 




Leu 


Leu 


Gin 


Asn 


Leu 


Gly 


Leu Val 



19 







235 










9 A r\ 


He 


Ser 


Ala 


pne 


val 


Arg 




Lys 




250 










£OD 




Met 


He 


GlU 


Arg 


Asn 


inr 


vai 


inr 


265 










on n 
£ /U 






Lys 


Leu 


Gly 


Lys 


vai 


Asp 


\j±Y 


ni s 








285 








Ser 


Tyr 


Pro 


Ser 


Arg 


Pro 


Asp 


val 








300 










Ala 


lie 


Pro 


Ala 


Gly 


Lys 


lie 


Val 






315 










320 


Gly 


Lys 


Ser 


Thr 


Val 


lie 


Ser 


Leu 


330 










335 




Ser 


Gly 


Ala 


Val 


Leu 


Leu 


Asp 


Gly 


345 










350 






Lys 


Trp 


Leu 


Arg 


Gly 


Gin 


lie 


Gly 








365 








Phe 


Ala 


Thr 


Thr 


He 


Arg 


Glu 


Asn 








380 










Thr 


Ala 


Glu 


Glu 


lie 


Thr 


Arg 


Ala 






395 










400 


Phe 


He 


Asn 


Asn 


Leu 


Pro 


Glu 


Gly 




410 










415 




Gly 


He 


Gin 


Leu 


Ser 


Gly 


Gly 


Gin 


425 










430 






Ala 


lie 


Val 


Lys 


Asn 


Pro 


Ser 


lie 








445 








Ala 


Leu 


Asp 


Ala 


Glu 


Ser 


Glu 


Lys 








460 










Val 


Met 


Val 


Gly 


Arg 


Thr 


Thr 


Val 






475 










480 


Val 


Arg 


Asn 


Ala 


Asp 


lie 


He 


Ala 




490 










495 




Glu 


Phe 


Gly 


Asn 


His 


Glu 


Asn 


T 

Leu 


505 










510 






Ser 


Ser 


Leu 


Leu 


Arg 


Leu 


Gin 


Glu 










525 








Ser 


Leu 


Asn 


Arg 


Thr 


Leu 


Ser 


Arg 








540 










Glu 


Leu 


Ser 


Arg 


Thr 


Arg 


Ser 


Ser 






555 










C C C\ 


Thr 


Arg 


Pro 


Asp 


Gly 


Ala 


Asp 


Pro 




570 










575 




Gly 


Arg 


Leu 


Tyr 


Ser 


Met 


lie 


Arg 


585 










590 






Gly 


Thr 


lie 


Cys 


Ala 


Phe 


He 


Ala 








605 








Leu 


Gly 


Val 


Ser 


Gin 


Ala 


Leu 


Val 






620 










Thr 


Gin 


Lys 


Glu 


lie 


Lys 


Lys 


He 






635 










640 


Val 


lie 


Thr 


Leu 


lie 


Val 


Tyr 


Thr 




650 










655 




Met 


Gly 


Glu 


Arg 


Leu 


Thr 


Leu 


Arg 


665 










670 






He 


Leu 


Lys 


Asn 


Glu 


lie 


Gly 


Trp 










685 








Ser 


Met 


Leu 


Ala 


Ser 


Arg 


Leu 


Glu 








700 










He 


Val 


Val 


Asp 


Arg 


Ser 


Thr 


lie 






715 










720 


Val 


Thr 


Ser 


Phe 


He 


lie 


Ala 


Phe 
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730 






735 




Tip 

lie 


Leu 


Asn 




Aro 


Leu 


Thr 


Leu 


Val 


Val Leu Ala Thr Tyr 


Pro 


Leu 








740 








745 






750 






Vdl 


X J- 1± 


Qpr 


Gly 


His 


He 


Ser 


Glu 


Lys 


Leu Phe Met 


Gin Gly 


Tyr Gly 






/ jj 








760 






765 








Gly 


Asp 


Leu 


Asn 


Lvs 


Ala 


Tvr 


Leu 


Lvs 


Ala Asn Met 


Leu 


AT t\ 


Gly Glu 


11 c\ 










775 






780 










Ser 


val 


Ser 


Asn 


Tl A 

1 It; 


Arg 




Val 


Ala 


Ala Phe Cys Ala 




G1U 


Lys 


/ oo 










790 








795 








o c\ n 


lie 


Leu 


wlU 


Leu 


A y r 

pnc; 

DUO 


Ser 


Ar g 


Glu 


Leu 


Leu Glu Pro 
810 


Ser 


Lys 


Ser 
815 


Ser 


rue 


Arg 


Arg 


Gly 


Gin 


He 


Ala 


Glv 


Leu 


Phe Tyr Gly Val 


ope 


Gin 


Phe 














825 












Pne 


lie 




Ser 


Ser 


iyr 




Leu 


Ala 


Leu Trp Tyr 


Gly 


Ser 


Thr 


Leu 




















845 








Met 


Asp 


Lys 


Gly 


Leu 


Ala 




IT lie; 


Lys 


Ser Val Met 


Lys 


Thr 


Phe 


Met 




OCA 














860 










IT- I 

Val 


Leu 


He 


vai 


TV* v 

i nr 


Al ^ 
Aid 


Leu 


AT a 
/ILa 


Met 


Gly Glu Thr 


Leu 


TV "I _ 

Ala 


Leu 


Ala 


865 










Q7 n 








875 








880 


Pro 


Asp 


Leu 


Leu 


Lys 


uiy 


Asn 


Gin 


Mat- 
net 


Val Ala Ser 


Val 


Phe 


Glu 


He 








QQC 

OOD 








890 






895 




Leu 


Asp 


Arg 


Lys 


Thr 


Gin 


lie 


Vdl 


oi y 


Glu Thr Ser 


Glu 


Glu 


Leu 


Asn 




900 
















910 






Asn 


Val 


CalU 


biy 


inr 


Tl a 

±xe 


bill 


T.<^1 1 


Lys 


Gly Val His 


Phe 


Ser 


Tyr 


Pro 






915 








920 






925 








Ser 


Arg 


Pro 


Asp 


vai 


Vdl 


Tl p 

lie 


Phe 


Arg 


Asp Phe Asp 


Leu 


He 


Val Arg 




930 








935 






940 










Ala 


Gly 


Lys 


Ser 


Met 


TV 1 -i 

Ala 


Leu 


Val 


Gly 


Gin Ser Gly 


Ser 


Gly 


Lys 


Ser 


945 






950 








955 








960 


Ser 


Val 


lie 


Ser 


Leu 


He 


Leu 


Arg 


Phe 


Tyr Asp Pro 


Thr 


Ala 


Gly Lys 










y bo 








970 






975 




Val 


Met 


He 


blU 


Gly 


Lys 


Asp 


He 


Lys 


Lys Leu Asp 


Leu 


Lys 


Ala 


Leu 








980 






985 






990 






Arg 


Lys 


His 


He 


Gly 


Leu 


Val 


Gin 


Gin 


Glu Pro Ala 


Leu 


Phe 


Ala 


Thr 


995 










1000 




1005 






Thr 


He 


Tyr 

3 


Glu 


Asn 


He 


Leu 


Tyr 


Gly 


Asn Glu Gly Ala 


Ser 


Gin 


Ser 




10K 








1015 




1020 








Glu 


Val 


Val 


Glu 


Ser 


Ala 


Met 


Leu 


Ala 


Asn Ala His 


Ser 


Phe 


He 


Thr 


1025 








1030 






1035 








1040 


Ser 


Leu 


Pro 


Glu 


Gly 


Tyr 


Ser 


Thr 


Lys 


Val Gly Glu Arg Gly 


Val 


Gin 










104! 








1050 






1055 


Met 


Ser Gly 


Gly 


Gin Arg Gin Arg 


He 


Ala He Ala Arg Ala 


He 


Leu 








1060 








1065 




1070 




Lys 


Asn 


Pro 


Ala 


He 


Leu 


Leu 


Leu Asp 


Glu Ala Thr 


Ser 


Ala 


Leu Asp 




1075 








1080 




1085 






Val 


Glu Ser Glu Arg Val Val 


Gin 


Gin 


Ala Leu Asp Arg 


Leu 


Met Ala 




1090 








1095 




1100 








Asn Arq Thr Thr Val Val Val Ala His Arg Leu Ser 


Thr 


He 


Lys Asn 


1105 








1110 






1115 








1120 


Ala Asp Thr 


He 


Ser 


Val 


Leu 


His 


Gly Gly Lys He 


Val 


Glu 


Gin 


Gly 










1125 








1130 






1135 


Ser 


His 


Arg 


Lys 


Leu Val 


Leu 


Asn 


Lys 


Ser Gly Pro Tyr 


Phe 


Lys 


Leu 






1140 








1145 




1150 




He 


Ser 


Leu 


Gin 


Gin 


Gin 


Gin 


Gin 


Pro 













1155 1160 



<210> 9 
<211> 986 
<212> PRT 

<213> Artificial Sequence 
<220> 
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<223> synthetic consensus sequence 



<400> 9 



Met 


Asp 


Glu 


Gly 


Ala 


Leu 


Ser 


Asp 


Arg 


Lys 


Lys 


Lys Val Gly Val Leu 


1 








5 










10 




15 


Phe 


Arg 


Tyr 


Ala 


Asp 


Trp 


Asp 


Lys 


Leu 


Met 


Leu 


Gly Thr Leu Ala Ala 








20 










25 






30 


lie 


He 


His 


Gly 


Ser 


Leu 


Pro 


Leu 


Met 


Met 


He 


Val Phe Gly Glu Met 






35 










40 








45 


Thr 


Asp 


Phe 


Ala 


Ser 


Lys 


Leu 


Glu 


Glu 


Glu 


Met 


Thr Arg Tyr Ala Tyr 




50 










55 










60 


Tyr 


Tyr 


Ser 


Gly 


Leu 


Gly 


Ala 


Gly 


Val 


Leu 


Val 


Ala Tyr lie Gin Val 


65 








70 










75 


80 


Ser 


Trp 


Leu 


Ala 


Ala 


Gly 


Arg 


Gin 


He 


Arg 


Lys 


He Arg Lys Phe Phe 








85 










90 




95 


His 


Ala 


He 


Leu 


Arg 


Gin 


Glu 


He 


Gly 


Trp 


Phe 


Asp lie Thr Gly Glu 








100 










105 






110 


Leu 


Asn 


Thr 


Arg 


Leu 


Thr 


Asp 


Asp 


He 


Ser 


Lys 


He Asn Asp Gly lie 






115 










120 








125 


Gly 


Asp 


Lys 


Val 


Gly 


Met 


Phe 


Phe 


Gin 


Val 


Ala 


Thr Phe Leu Ala Gly 




130 










135 










140 


Phe 


He 


Val 


Gly 


Phe 


He 


Gly 


Trp 


Lys 


Leu 


Thr 


Leu Val lie Leu Ala 


145 










150 










155 


160 


lie 


Ser 


Pro 


He 


He 


Gly 


Leu 


Ser 


Ala 


Ala 


Val 


Trp Ala Lys He Leu 










165 








170 




175 


Ser 


Phe 


Ser 


Lys 


Glu 


Leu 


Ala 


Tyr 


Ala 


Lys 


Ala 


Gly Ala Val Ala Glu 








180 










185 






190 


Glu 


Leu 


Gly 


Ala 


He 


Arg 


Thr 


Val 


He 


Ala 


Phe 


Gly Gly Gin Lys Glu 






195 










200 








205 


Leu 


Glu 


Arg 


Tyr 


Gin 


Lys 


Leu 


Glu 


Ala 


Lys 


Lys 


lie Gly He Lys Lys 




210 










215 










220 


Ala 


He 


Ser 


Ala 


He 


Ser 


Met 


Gly 


Ala 


Phe 


Leu 


Leu lie Tyr Ala Ser 


225 










230 










235 


240 


Tyr 


Ala 


Leu 


Ala 


Phe 


Trp 


Tyr 


Gly 


Ser 


Thr 


Leu 


Val lie Ser Glu Tyr 








245 










250 




255 


Thr 


He 


Gly 


Ala 


Met 


Thr 


Val 


Phe 


Phe 


Ser 


He 


Leu He Gly Ala Phe 






260 










265 






270 


Ser 


Val 


Gly 


Gin 


Ala 


Ala 


Pro 


He 


Asp 


Ala 


Phe 


Ala Asn Ala Arg Gly 






275 










280 








285 


Ala 


Ala 


Tyr 


He 


Phe 


Lys 


He 


He 


Asp 


Asn 


Pro 


Ser He Asp Ser Phe 




290 










295 










300 


Ser 


Gly 


His 


Lys 


Pro 


Asp 


He 


Lys 


Gly 


Asn 


Leu 


Glu Phe Lys Asp Val 


305 










310 










315 


320 


His 


Phe 


Ser 


Tyr 


Pro 


Ser 


Arg 


Glu 


Val 


Lys 


lie 


Leu Lys Gly Leu Asn 








325 










330 




335 


Leu 


Lys 


Val 


Ser 


Gly 


Gin 


Thr 


Val 


Ala 


Leu 


Val 


Gly Ser Gly Cys Gly 






340 










345 






350 


Lys 


Ser 


Thr 


Thr 


Val 


Gin 


Leu 


He 


Gin 


Arg 


Leu 


Tyr Asp Pro Glu Gly 




355 










360 








365 


Val 


He 


Asp 


Gly 


Gin 


Asp 


He 


Arg 


Thr 


He 


Asn 


Val Arg Tyr Leu Arg 




370 










375 










380 


Glu 


He 


He 


Gly 


Val 


Val 


Ser 


Gin 


Glu 


Pro 


Val 


Leu Phe Ala Thr Thr 


385 










390 










395 


400 


He 


Ala 


Glu 


Asn 


He 


Tyr 


Gly 


Arg 


Asp 


Val 


Thr 


Met Asp Glu He Glu 










405 










410 




415 


T », n 


Ala 


Val 


L^s 


Glu 


Ala 


Asn 


Ala 


Tyr 


Glu 


Phe 


lie Met Lys Leu Pro 






420 










425 






430 


Phe 


Asp 


Thr 


Leu 


Val 


Gly 


Glu 


Arg 


Gly 


Ala 


Gin 


Leu Ser Gly Gly Gin 






435 










440 








445 


Lys 


Gin 


Arg 


lie 


Ala 


He 


Ala 


Arg 


Ala 


Leu 


Val 


Arg Asn Pro Lys lie 




450 










455 










460 


Leu 


Leu 


Leu 


Asp 


Glu 


Ala 


Thr 


Ser 


Ala 


Leu 


Asp 


Thr Glu Ser Glu Ala 
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Val 


Val 


Gin 


Ala 


Ala 


His 


Arg 


Leu 








500 


Glu 


Asp 


Gly 


Val 






515 




Gly 


Val 


Tyr 


Phe 




530 






Met 


Lys 


Ser 


Leu 


545 








Val 


Ser 


Phe 


Trp 


Val 


Gly 


Thr 


Val 






580 


He 


Leu 


Ser 


He 






595 




Ala 


Val 


Val 


Pro 




610 






Leu 


Gly 


Asn 


Ala 










Glu 


Ala 


He 


Glu 


Glu 


Ser 


Met 


Tyr 








660 


Tyr 


Gly 


He 


Thr 






675 




Gly 


Cys 


Phe 


Arg 




690 






He 


Leu 


Val 


Phe 


705 








Ser 


Ser 


Phe 


Ala 


Phe 


Leu 


He 


Glu 








740 


Leu 


Glu 


Gly 


Val 






755 




Val 


Leu 


Gin 


Gly 




770 






Leu 


Val 


Gly 


Ser 


785 








Glu 


Arg 


Phe 


Tyr 


Glu 


He 


Lys 


Lys 








820 


Val 


Ser 


Gin 


Glu 






835 




Ala 


Tyr 


Gly 


Asp 




850 






Ala 


Asn 


He 


His 


865 








Gly 


Asp 


Lys 


Gly 


Ala 


He 


Ala 


Arg 








900 


Ala 


Thr 


Ser 


Ala 






915 




Leu 


Asp 


Lys 


Ala 




930 






Leu 


Ser 


Thr 


He 


945 








Val 


Lys 


Glu 


His 



470 



Leu Asp 


Lys 


Ala 


485 








Ser 


Thr 


Val 


Arg 


lie 


Val 


Glu 


Gly 








520 


Lys 


Leu 


Val 


Met 






535 




Arg 


Ser 


Ser 


Gin 




550 






Arg Val 


Leu 


Lys 


565 








Cys Ala 


He 


lie 


He Ala 


Val 


Phe 








600 


He 


He 


Val 


Val 






615 




Arg Asp 


Lys 


Lys 




630 






Asn 


He 


Arg 


Thr 


645 








Leu 


Pro 


Tyr 


Arg 


Phe 


Ser 


He 


Ser 








680 


Phe Gly 


Ala 


Tyr 






695 




Ser 


Ala 


He 


Val 




710 






Pro 


Asp 


Tyr 


Ala 


725 








Arg 


Pro 


He 


Asp 


Phe 


Val 


Phe 


Asn 








760 


Leu 


Ser 


Leu 


Glu 






775 




Ser 


Gly 


Cys 


Gly 




790 






Asp 


Pro 


Leu 


Ala 


805 








Leu 


Asn 


Val 


Gin 


Pro 


lie 


Leu 


Phe 








840 


Asn 


Ser 


Arg 


Val 






855 




Phe 


lie 


Glu 


Thr 




870 






Thr 


Gin 


Leu 


Ser 


885 








Ala 


Leu 


lie 


Arg 


Leu Asp 


Thr 


Glu 








920 


Arg 


Glu 


Gly 


Arg 






935 




Gin 


Asn 


Ala 


Asp 




950 






Gly Thr 


His 


Gin 



22. 







475 




Arg 


Gly 


Arg 


Thr 




490 






Asn 


Ala 


Asp 


Val 


505 








Ser 


His 


Asp 


Glu 


Gin 


Thr 


Gly 


He 








540 


Asp 


Arg 


Asp 


Asp 






555 




Leu 


Asn 


Thr 


Glu 




570 






Asn 


Gly 


Leu 


Gin 


585 








Asn 


Asp 


Val 


Lvs 

2 


Ala 


Gly 


He 


Val 








620 


Leu 


Glu 


Ala 


Gly 






635 




Val 


Val 


Ser 


Leu 




650 






Asn 


Ser 


Val 


Arg 


665 








Gin 


Ala 


Met 


Tyr 


Leu 


Val 


His 


Gly 








700 


Leu 


Gly 


Ala 


Val 




715 




Lys 


Ala 


Lys 


Leu 




730 






Ser 


Tyr 


Ser 


Glu 


745 








Tyr 


Pro 


Thr 


Arg 


Val 


Lys 


Lys 


Gly 








780 


Lys 


Ser 


Thr 


Val 




795 




Gly 


Lys 


Val 


Leu 




810 






Trp 


Leu 


Arg 


Ala 


825 








Asp 


Cys 


Ser 


lie 


Ser 


Asp 


Glu 


lie 








860 


Leu 


Pro 


Asp 


Lys 






875 




Gly 


Gly 


Gin 


Lys 




890 






Gin 


Pro 


lie 


Leu 


905 








Ser 


Glu 


Lys 


Val 


Thr 


Cys 


He 


Val 








940 


Leu 


He 


Val 


Val 






955 




Gin 


Leu 


Leu 


Ala 



480 



Thr 


He 


val 


T 1 Q 

lie 






495 




He 


Ala 


Gly 


Fne 




510 






Leu 


Met 


Lys 


Lys 


525 








Asn 


Ser 


Glu 


Ala 


Leu 


Glu 


Val 


Pro 








560 


Trp 


Pro 


Tyr 


Val 






575 




Pro 


Phe 


Ala 


lie 




590 






Leu 


Leu 


Leu 


Leu 


605 








Glu 


Met 


Lys 


Leu 


Lys 


lie 


Ala 


Thr 






640 


Thr 


Glu 


Lys 


Phe 






655 




Lys 


Ala 


His 


lie 


670 






Phe 


Ser 


Tyr 


Ala 


685 








Leu 


Met 


Phe 


Val 


Ala 


Leu 


Gly 


Thr 








720 


Ser 


Ala 


Ala 


Leu 






735 




Gly 


Leu 


Pro 


Asp 




750 






Pro 


Asp 


Val 


Pro 


765 








Gin 


Thr 


Leu 


Ala 


Val 


Gin 


Leu 


Leu 








800 


Leu 


Asp 


Gly 


Lys 






815 




His 


Leu 


Gly 


He 




o o r> 






Ala 


Glu 


Asn 


He 


845 








Val 


Ala 


Ala 


Lys 


Tyr 


Thr 


Arg 


Val 








OOfl 

OOU 


Gin 


Arg 


lie 


Ala 






895 




Leu 


Leu 


Asp 


Glu 




910 






Val 


Gin 


Glu 


Ala 


925 








lie 


Ala 


His 


Arg 


lie 


Asn 


Gly 


Lys 








960 


Gin 


Lys 


Gly 


lie 
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965 970 975 

Tyr Phe Ser Met Val Val Gin Ala Gly Thr 
980 985 

<210> 10 
<211> 14000 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> misc_feature 
<222> (3429) ... (0) 
<223> Translation start codon 

<300> 

<308> Genbank AP000386 
<309> 1999-08-03 

<400> 10 

caattaatta tatcaaaatt tcgtatatca tttaaaattc atcactgatt tttgtttaga 60 

aaaaaagata gatagctatg gacatgacgt cgaattttaa tatatcctat gtaacagtgt 120 

tcatataatc aaaaaagaaa aaataattac tattgtttgt gttctttgca acaatgcgtt 180 

agaaactcgg agaccacttg ttttttctct tatgatttcg cttcagtatg atttagcaaa 240 

gcatctctac ggtacaaaat atcttacgaa ttgacatgat ctgatccaca ctatatcaag 300 

gagttagaaa gacagaactg aaagcttctt agctcgatat gtatttcaag ctacggttca 360 

caccatctag ggagacaaga atggaagaag gcatgacgtc aagtcgaact atgcagctct 420 

cttacttaaa tgggtcgaca aacaaatacg gaaccaacta tcatccattc gagcaagcgg 4 80 

tgataggtgg tatgactaag gtctctaaat ttggtttacg gcgaaaaatt aattgatcaa 540 

aaaatttttg gtttagtgtt ttaagatctt tttcaaaaaa acgttgtatt tgatgtataa 600 

acgctatttc tttttgaaat taatttaata ttatttttgt ttgaaaagaa gaagaaacat 660 

aattcaaaca atccctatct ttaatgttcc aacatcttaa taaatacata attaaaactc 720 

tacgatcaat actacacttc tgacgattat taaatcgcat cgtcgtacac tattattaca 780 

aagaccgttc tatctacatt aatattcttt atatattttt ttttttcatt ttttgagatt 840 

gttggagtat atatccttag attttcttgt atataaaaat agataattat aatgggataa 900 

acgaaacatc gttgaagccg tgaagtggca ttggtctaca ccagagcaac acaaaaagac 960 

aaccacttca catggtctgc tcttacattc ataaccgaat ctaagtcact tttagttggt 1020 

tgtaattttg taactattat tccaacacct ctttttttac tttttagatt gttttataat 1080 

agaaatattt taattcctaa attaataatg aaagtaaatg taatatgagt cagtacaata 1140 

tgtgaaaaac ttaaaaagtt gacagaattt agcatttgat taaaagtgta tgaagaagaa 1200 

gaaaaaagaa gatctttttg tatctataga tttagtgcat aacttttctc agattttcga 1260 

tatatacaag aatttaacat aagaaaaatc aagacaaatg gacctggtta taatcggtta 1320 

tctgttgtaa atattatatt tcatattctt ctccacttca taattcttat tggagttcct 1380 

tcaccaaatg tttgatgttc cattaaatta atctaccact ctaataagag gtatcgtact 1440 

acaaattaca cttcataaac aagagaagaa cataaatttg aattttttta aaaaaacata 1500 

tgcgttataa caccaaacag taacggacta gctgatcctt gaatttatat tagttgcaaa 1560 

aatttatata tcgaaattga aacatgaatt tttaaaatta ttagaaaatg tatgatgttg 1620 

tctaaatgtg acattacaaa tacatgatgt tgtttaaaaa ttattataaa acaaccaaag 1680 

tttggcgtca ttctggtaaa cgcacttaat gaatttatat acagttaaag attttaacat 1740 

caaattttaa aatgacaact taactaaaat ttgtatccta atatttttac tagagaaact 1800 

cacacatatt tttcaaacaa atgattagtt atatatcctt cgataatagg tattgtgtaa 1860 

aactgtgtgg tttgcacaag tgatccctcg atattctttc tgctaaagat cgacttccca 1920 

cagtttcgat atctcgggtt tgggtgcaat agcatatgct tgtttagtat gcagataatc 1980 

gtatgagaga gtcagagaga tcatctattt ttcatgatat ggtctgagct aatcgatcat 2040 

ttgttattta tactcgataa cgttctaatt tgtatgattt ttcagccttg atctatcaca 2100 

aaatggagat aatgaaatgg tagtcataag aaaggtaatg atccccttga catgcttatt 2160 

ataacacaaa aagtaatcgc tactgattag cttaccctat gatttgaatc atataacttt 2220 

atactaattg tctaagtgtg atgatataat gtatatgaty tcatttataa cttattagta 2280 

aaaatataaa gttcttcatc attgtcataa ggagttaatt ggatatacat caaaaaattt 2340 

cctaaatttt tagaaattat taatcaagtt ataatagaaa ttattaaata aaatatatga 24 00 

tgttttctaa atgtgacgat acaaaataca tgatgttgtt caaaacttat tagacagaaa 2460 

ccaaagtaca tcatcattgt gatgaagact ttaatggatt tatacatttc aaagttttaa 2520 

acattactta tttaaatgaa aaattgagct aaaatttgta tacttatatg ttgaccatag 2580 
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aaatacttct atataatttt agacaagttg tctatacata tatatatata tatatatata 2640 

tatatatatg tatttacaca aaaatcaagt tttcttaatg taacaatata aaaatataaa 2700 

atatctaatg ttgtttaaaa attatttaga ccaaaaaaaa aaaaaactgt aatcaaactt 2760 

tcattggatt atatctaact ctaaacatga atttttatac aatgacatgt tcaataaaat 2820 

tcccatttca tagtttgacc agaaaaaatg cttgcagatt tgttaaaatc tctacttatg 2880 

caaaaccatt gaatcacacc agattttgat ttgtagtact ttaaaaatat taattgaaca 2940 

caacataact ttaacaaaaa agatctttat aaaaaaaaaa ctttctaaaa tgacattaat 3000 

taacacaaat tagtcaattc aaacaatcca ataaaatcaa agtaaaccct taattgtttt 3060 

ttttttccat tttacaaaat gttaaacatt aattacatca actagcaaca cgaatctctc 3120 

aatctctcat gtctcaatga gtctttgacc actagtcttc atcactctct ctccctccta 3180 

tatataccaa tcacaccctt ctccaaccac actctcattc tctctctctt ctcaccactt 3240 

caaaacaaga cactcgtaaa agagttcttg aacttcacaa aacaattgtc agattttcaa 3300 

gaaaaacttt ataaaacaaa aaacatttca ttctttctct ctctctctct cactgctcaa 3360 

tgatctcgtt ttctcactaa accaactcgt ttcttcttac tttctttaac tcggatctac 3420 

aaaaaaccat gtcggaaact aacacaaccg atgccaagac tgttccagca gaagcagaga 34 80 

agaagaaaga acagagttta ccattcttta aactcttttc tttcgctgat aaatttgatt 3540 

atctcttaat gttcgttggt tctcttggtg ccattgttca tggctcttcc atgcctgtct 3600 

tctttttact ctttggtcaa atggttaatg gatttggtaa aaaccaaatg gatttacatc 3660 

aaatggttca tgaagtctct agagtaagtc ttttttttct ttcttgtttt tatcaaatta 3720 

gatccaagat ctgatctaat ttttgtgttt gtgaaatttt gcagtattct ctatatttcg 3780 

tctacttggg tttggtcgtt tgcttctctt cttacgcagg tttgtttctt acaaatccaa 3840 

tcttcttttt ctttattctt caatcaagaa acccagtaat taaatcacat tataaacaag 3900 

atcaagaaat tttttgcaca aaaaaaaaag aaaacaagat caagaaatta ataaagttta 3960 

gattaaacat ttgcattatt ttcttttttt tggttcaacc aattaaccca ttaacataaa 4020 

ttaaatcaga ttattaggca agaaatctaa aggtgtattt tttgagtttt atttgattct 4080 

tcctcaaagt gaccatttac tctgaaggta aaacattttt ttttactcta caacaaaaac 414 0 

ccaacttctt tttttgtttg cttcaaaatt atattaaaaa aagaaagctt aaactttgtt 4200 

taaaatcttg ttttttttgt cattcgcttt tgattagaac taaaaaaacc atttttatag 4260 

aatgttgttt acataagtga tgtaatgggg ttcggaacga actttccgtg ccaaccgctt 4320 

ttatagacaa gaaatctaaa ggtgtaattt ttgagttttt atttgattct tccacaaagt 4380 

gatcatttac tctggtaaac atttttttac tctacaacaa aaacccaact tttttatgtt 4440 

tgcttcaata aataaaaacc tagaaacaga aatcacaatc tagagagaat caaaattata 4500 

tataaaaaaa aaacacttca aatttttaaa aattttaata tgttaaaagg ataagccaag 4560 

tccacgtgat tcatggacta ctactttgtc tatatcgaaa aaaaaaaaaa aatgggcatc 4 620 

tctctcacat ttattacaac ttcataaaaa ttcttgtaat aataaccata attttttgtt 4680 

aaataatttt acagagatag catgttggat gtattctgga gaaagacaag tagcagcatt 4740 

aaggaagaaa tatcttgaag cagtattaaa acaagacgtt gggttctttg atactgatgc 4800 

tagaactggt gacattgtct ttagtgtttc tactgatact cttcttgttc aagatgccat 4860 

tagtgaaaag gtaaccattt ttttttttac taccaacaaa atctagtatt taccggcggg 4920 

tcccattttt ttcatgatca atctttatca aagttttggg tccgtttatg ttagcaaccg 4 98 0 

tcgtttatct tgcgcctcag gtggttaggg aagaaaaaag actatagatt gatttttttt 5040 

tttgttaaaa taaaagttgg atttttttca taagattgtg aaaataagta attttttttt 5100 

acaaatttgc tatagaaata gaggattttt ttcactattt aactaaaact taaatgatat 5160 

agtactcatt ttcttacatt tttaatataa aaatagcttc tttttttcca ttgtgaaaaa 5220 

tatttattat gcaaaataat gtttctttaa caagtaaaat aaatatttgt tattaaaaaa 5280 

aatgtgtaaa gtaccttcaa attttaatag tacattttat ctaaaactgt tttttttttt 5340 

gcaatttggt cctgtctctg tggtgcacgt tagcaagcac gagctaacat gcactgggtc 5400 

ccctagtttc attattatca tcactctttt ttattgttga tgaagtgaat taataacaga 5460 

gtagttgaaa aaaaattgaa tttctcaggt tggaaacttt atacattacc tctcaacatt 5520 

tttggcggga ttagtagttg gatttgtatc agcatggaaa ttagctttgt taagtgttgc 5580 

tgtgattccc ggaatcgctt tcgccggagg tttatacgct tatacactca ccggaattac 5640 

ttcaaagagc cgtgaatctt atgctaacgc cggtgttatc gccgagcagg taaataaaga 5700 

ttaaagtttg ttcctttatt tgttttattt aatgcccacc aactgttcga cgtaatgctt 5760 

tttaactgtt cttaggcttg cttctttgta cttgatctct caaatgattt tttttttttt 5820 

tcattgcttc ttatgcaatc caaagactta aacagtgttt tcactaattc aaaatctttg 5880 

cttttctttg gtcggttaat gattgatgtt ttggtttctc aggcaattgc tcaagttcga 5940 

actgtttatt cttatgttgg agagagtaag gcacttaatg cgtattcgga fcgcgattcag 6000 

tatacgctta agctcggtta taaagcgggg atggctaaag ggttgggttt aggatgtact 6060 

tatggaatag cttgtatgtc atgggctttg gtgttttggt atgctggagt ttttattcgg 6120 

aatggacaaa ccgatggagg aaaggcgttt actgctatat tctctgctat tgttggtgga 6180 

atgtaagaat tcaaaacata acaatgatgt ggttttgtgt attttgtgtt ttgttattac 6240 

ttggatgtca ctttttgttt ctgtgtgtgt ttttaggagt ttggggcaat ctttctcgaa 6300 
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tcttggggcg 
gagaccgacg 
cattgagttt 
gaactttaat 
tggaaagagt 
gattgaattt 
attttttctt 
ttgaagtttt 
actatactag 
gctgcttccg 
caggtataaa 
ttttgttgaa 
gaagcagaga 
tgaagctaca 
agtcatggtg 
tgattccatt 
cattgccaaa 
agatttctca 
aacgaaatca 
tggagctgat 
ccctgaaaat 
catgggagca 
gagcaacatg 
agagtatgtc 
acattacttc 
ctcaggtatg 
ggtttcttga 
atgaacacaa 
ccgctatagc 
tcatagtcgc 
ttctagtcct 
aatattcatc 
aaaaacgaca 
ttatagaaca 
aagtttgcat 
ctactttttg 
taaaaaaatt 
attatatagt 
gattcctctg 
actctctgaa 
aacctcacca 
atttagagtt 
tgtgtgtcag 
tggtagatta 
tctgctttga 
tatcaaatag 
gggatagagt 
cttagacctt 
tctaaatcat 
gctggagaca 
aacattagaa 
gagcttcgtg 
ggcctctcgc 
cttgtgagta 
attactgcaa 
gaagctgttg 
gctgatgctg 
gcttaccctt 
ggacatagcc 
atcgagcggt 
cggctaaacc 
ttcgcagcaa 



tttagtaaag 
ataattcaag 
aaagatgtga 
attttcttcc 
actgttgttt 
gttttttgct 
gatgagcagg 
tgcgtgaaca 
agaacatact 
ctgcgaatgc 
tcaaatattt 
tgtcaataca 
attgcaattg 
agcgctcttg 
gggaggacca 
gccgtgatac 
tccggtgctt 
aacccgtcaa 
ctcagtttaa 
ggtcggatag 
tacttctaca 
gtaggctcaa 
atcgaagtct 
ttcatctaca 
tttagcatca 
tatcaaaatc 
ttgttcttat 
ctcaagcctg 
cgagagaatc 
cttcatagta 
cgctaacttt 
aatcattaaa 
aaaggccaga 
tggcaacaca 
ttatgtttac 
gaaatagtgt 
aaaaaacagt 
ttctattggt 
tcttttgctt 
ctctgaactg 
caaaaatcca 
cttcattggt 
tgtttttaca 
ggttgtgatg 
tatatcacca 
atttcacaca 
gttgcatatg 
tgattgtcaa 
aagtattaat 
cagctaaggc 
ccgtagcagc 
tacctcagaa 
agcttgctct 
aaggcgtgtc 
actctgttgc 
gttcggtttt 
atcccgtgga 
caagacccga 
aagctcttgt 
tttacgaccc 
tgaaatctct 
cgatcttcga 



gtaaagcggc 
acccgttgga 
cttttagcta 
cttctgggaa 
ccctcattga 
atttgagtct 
gcaaattctg 
aatcgggctt 
ctatggaaag 
gcatagtttc 
gattatatgt 
attaggttgg 
ctagggcgat 
atgctagctc 
ctg.ttgttgt 
agcaaggcca 
acgcatccct 
ctcgtcgcac 
gatcaggaag 
agatgatttc 
ggcttctcaa 
ttctttctgg 
tctactacac 
ttggtgctgg 
tgggagaaaa 
tcctgaattt 
ttagctatct 
atcgctgcac 
tcagtaattc 
gaatggagag 
gctcaggtaa 
atataattgg 
tttttataaa 
gccactgcct 
aataataatc 
aagggtcaag 
aggctttagt 
actatgaccc 
ctgctaaatc 
tgatcccact 
aaaaatctgc 
caaagttgtt 
ccatccacta 
agtttttttt 
acattttttc 
gaagaacata 
tgatttaact 
tcttattttt 
atatgtggct 
tcatgcaaag 
tttcaatgca 
aagaagctta 
ctatggttct 
aaccttttcc 
tgaaactgtc 
ctcggtcttg 
gacgatccgt 
cgtcatggtt 
gggcgcgagt 
tcttgctgga 
aaggctcaaa 
caacatcgcc 



tggttataag 
tggaaaatgt 
tccttcacgg 
aactgtggcg 
gagattctat 
ttggctagtg 
ttggatggtg 
gtgaatcaag 
cctgatgcaa 
attacattac 
ctagttaatg 
agaacgtggt 
gttgaaagac 
tgagagcatt 
tgctcatcgt 
agttgttgaa 
catcaggttt 
tcgttcaacc 
tttgaggaat 
aaatgcagag 
gcttaattca 
tttcattggt 
agactatgat 
tctctatgct 
cctcacaaca 
gcttaaaatc 
tgagaaacga 
gtttagctac 
tacaaaacat 
tctcacttct 
ataatctaat 
aatcttgcat 
tttatatttg 
ctacacgtgc 
ataaaagaat 
atcatggagc 
ttactctctg 
atagataaca 
tatcctactt 
tctctcttta 
aatttttttc 
gtctcagtgc 
gtcaatgttt 
ttgtttctaa 
acccgtgatc 
gataggcatt 
atatggttct 
tacaagtgaa 
acattgcagc 
acttcaatga 
cagagcaaga 
taccgaagtc 
gaggctttaa 
aaagtgatca 
agtcttgctc 
gacaggcaga 
ggagacattg 
ttcagggact 
gggtcaggga 
aaagtcatga 
atcggtcttg 
tatggtaaag 



ttgatggaga 
ttggatcaag 
cctgatgtta 
gttgttggtg 
gatccaaaca 
actggatcat 
ttgagataaa 
aacctgcgct 
caatggttga 
ttcctaaagg 
tcatggcctt 
gttcaactct 
ccaaagattc 
gttcaggaag 
ctctgcacca 
accggaacac 
caggaaatgg 
cgtttgagcc 
ctgagctatt 
actgaccgaa 
ccggaatggc 
cctacatttg 
tcaatggaaa 
gtgggtgctt 
agagtaagaa 
actttcccat 
agttggttgg 
tgatgcagca 
gacttcactt 
catcttaggc 
ctttttactc 
tcaccattat 
cttttcaaaa 
ttcatcttct 
tattaagaag 
cctcacatca 
ggcatgtgtg 
gtgttcacga 
taagctttca 
tttaattctt 
cttcttagaa 
attatttact 
gcttgtgggt 
ctagctgcaa 
taattattag 
gtacttgtac 
acgtcatgtt 
ctattattac 
aactatctct 
ttgctggtga 
ttctctcttt 
aaacctcggg 
ttctctggta 
aagtgtttgt 
ctgaaattat 
ccaggattga 
agtttaggca 
ttaacctcag 
agagttctgt 
ttgatggcaa 
ttcaacaaga 
atggtgcaac 



taattaacca 
ttcatgggaa 
tgatcttcag 
ggagtggctc 
gcggtaattt 
aactttgttt 
gacgcttcag 
ctttgccact 
agttgaagct 
ctacgacaca 
ttgcttaatt 
caggtggaca 
tgttactaga 
ctttagacag 
tcagaaatgt 
atgaagaact 
ttggtactcg 
attcactgtc 
cttacagcac 
agactcgtgc 
cttactcaat 
ctattgtgat 
ggaaaacaaa 
atttgatcca 
gaatgatgct 
ttcttatttt 
ttcgatgagg 
gatgttaaat 
ctcacatcct 
acattcccac 
aaaatctttc 
gatcttaaga 
gttcaaaact 
aactttatcc 
cttttttttt 
ataaatgtgc 
aagaatattt 
aaatagctaa 
tatttacttc 
ttgcccataa 
ccaatatttt 
tatcacagtg 
ttctttgttt 
ggttcaggac 
ttgaaaaatc 
tgatgttgat 
ttagtggcga 
catctgttgt 
gaagggtttt 
aggagtcagt 
gttctgtcat 
tttcctattt 
tggtgcccac 
ggttttggtc 
tcggggaggt 
cccggatgat 
tgttgatttc 
aattcgagct 
aattgcgatg 
agacatccgc 
accagctctc 
tgaatccgag 



6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
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gtaattgatg cagctcgagc cgcaaatgct cacggtttca tcagtggttt acctgaaggt 10080 

tacaaaactc cagtaggcga aagaggagtg cagttatcag gtggacagaa acagaggatc 10140 

gcgatagcaa gagctgtgct caagaaccct acagtgttgc ttctagacga agcaactagc 10200 

gcactagatg cagaatcaga atgcgtgctg caagaggcgt tagagaggct catgagaggt 10260 

cggaccaccg tggtagttgc tcaccgcttg tccaccataa gaggtgttga ttgcattggt 10320 

gtgattcaag acgggcggat tgtggagcaa ggcagccatt cagagctcgt tagccgacca 10380 

gagggagctt attcaaggct gttacagctt caaacacata ggatttgaag cttgatcatg 10440 

gattaaaaac aaaaaatcgg tttgtgtaat tttttttata ttaaaacttt aatttggaag 10500 

atttctatgg actataacga taatatgaat aggtgtagat aatgaagctt ttggagtgtt 10560 

tatgaaggtt ctttaattaa gggttatttt tttcgcattt tgcttatgtg cccgttttgg 10620 

aacataagac tgaactatgt tttcgttatg tttttaattt atgcctcgaa acaaaacaaa 10680 

atcactctaa tactttggtt gatcaaaatt tccctaaaac atttggttca tgactagaat 10740 

tgatcggaca tgttctttta ggggtttgat ctcgcatggt ctaatcaggt tatgtggtct 10800 

tacagcgact taatcaattt agaccggcta tcaaaattag attcagccta aacagcaaaa 10860 

ttgtcttata taatatccta aatcacagac cggatttttt ttttttataa actctatcgg 10920 

ctgatccggt caaccccggt agaaacgcac tgaagtgttt catagtggac ttcaaaccga 10980 

aattttgcag aaattgagat gagaatgagc tctacctcgc attgacatgt tatttcgttc 11040 

ttgagatctt aagtaatttt ttgagatctt aagtaatttt ttggtattca agttccttat 11100 

ttttttgcgg ttactcgtct tgacatgtta tttcgttctt gtgctactta cataccacaa 11160 

tatatgtagg tttctaaaca tatataatag gaatgttgtt ctatatatgt gttgtttgat 11220 

acgtaaagag taatctggtt ctacgtagag tctatctctt gtttcttggt gagattttga 11280 

ctggactgca tggtaatcct cgcgcatggt gtttagttac cggttgcttc attcatttgg 11340 

ccacgctcaa attttctaga agactcttcg gtctttggta caaaggcctc cgatttttcc 114 00 

atgacaagtt cattcatatt tttttgtgac aacttgattg ttgttcattt ccgaatattc 11460 

catgaacgta tcaaactcaa taaaatcgat tgttgtcact ccctttaaga tgagatcctc 11520 

aacggcagct tcaccactag ccatgttttc acacatgatc aactatctca catttttact 11580 

aaggctttaa gcaggaatat tacttctaga attttaacaa tctagaattt ctagaggaaa 11640 

tgttcacatg tatatatttt ggtatatgat ctagagaact tatataagat tatcaatgtt 11700 

taatatactc aacaaatggg ttcttctatt ttatcttcta attgttgggg ccttgaatta 11760 

tttagttctt tttcactatt ccttttcttt tatttggtgt gtggagccat aaagttccaa 11820 

taatttctat gaaaagaaat gaataaaatc taaagaaacc aactgtgaag ctctttaatc 11880 

ttattctatc tgttggatct gtttgccttt atttgctttg catgtggttt tgctgtttaa 1194 0 

gataaaggcc atatcttctc ctaattcttt ttccaaatca acacatcaaa tatataaaag 12000 

actccaaata ttttgaatag aatcataatt tatttaatta caaaaaaact taaagaattc 12060 

aagacctgcg atcacaagac aaatacatgt aaggaattaa atatttaaga taccaaccgc 12120 

taataaataa gactcacaat ttctaaaaat taaaacgaaa aaaagaaaaa attagtcatc 12180 

attgcatatc ttagtgaaca aaaatcaata agaaaaggat aatatacttt ttaccaacat 12240 

cacataaacc taaaagcatc tcacaccaaa agtaaattaa aaactcagaa aaagctgaat 12300 

acaagttatg aattttatgg aaagtgttag tgtgttagtg ttgttatcat tacaacttta 12360 

aaataattat gaacatgaaa aaatagattg aacatgaagc agcattggga aggactcgaa 12420 

tgtccctact catactgaca aaacctcaag tgttccagtg cgttaggttt cactttccag 12480 

tagtatgtgt atctgctttt tttgtatata ttaatgtaat agatagtgta gagaatcatg 1254 0 

aggaacattt tgtaatcata gaactaaggc ataattttga tcgtgtgaaa taatactatt 12600 

atttttggta acaaggatga tctggaccga attccgacta attttccagg tgaatatagt 12660 

catcaagtaa cttgatcacc ggattttaaa aatgaattat gaacatgata tcatttcaac 12720 

taatcaaata aaaatgaaaa aattcatgtg ggaatcatat ccctcaaatc tgttaccacc 12780 

aaactacttt catttggttt gtgtgagaaa tataagaatt atcaaacaaa gaaaaaggtt 12840 

ttgcttaatt ataatgatat gaggttcatc aaatttatat accactaaaa tattcctaca 12900 

ctcatccaaa tattattctc gcttatgcag ttttaatcta atgtacaaat catatccata 12960 

agcgcattgg ccgatgctaa aattttggca ttagttgttg atttgttttt cttgagtcct 13020 

atgttcaaaa aatatatttg ataatttaat gtataaatta tacccataat cacattgatc 13080 

aatgcttaaa gttttgtatt agttgttgat tttttgtttg tttgcgtgtg ttgaattttc 13140 

ataaatgaaa agtaaatacg ataatcatat ttcaaaacgt aaaggtgtta attagtagtc 13200 

taaatggtaa aatataaaca gtatattata aaaattacta aaatggtttt gtaaaaaaaa 13260 

tatgctattt gtattataat gaaattcaaa aattttaaat agaacgtata atttctgcac 13320 

aaagaggttt tgaggtgtta taattcatga agtaaatttt atttactgac gggtaagttg 13380 

tgaaaaagtt tgaagactat tttttttgct tttcacagag aaaactactt tcctttttaa 13440 

ttgtatgatg agaaggcaaa agtgcagaca tgtgctttct tttctcccat tttcaacaat 13500 

gtcactcgtt gtattattca tattttagca aactggttat atctatatct atcaatcatt 13560 

tcagaacatc atatccatca gtttttggac attgctacat acgttagtat tgatgtacca 13620 

gttaccctaa caggcttttg catagtgtgg cagaacacgt gaggtgtgat atatgcggat 1368 0 

gaattctatg ttctgcattt tgttaccatt catataaaag tattgtttta gttgtgctgc 13740 
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ggtttaagtc ctaatacaga tttatttgta gtgttatgtt tataaaccac attggaaccc 13800 

aaactctaag agaatataag atcaatgtgt aattaataaa ttttatagtc attcggataa 13860 

aacttagaaa gataacaaaa gtaagaacga gtatttttaa gcgaatactc tttagatatt 13920 

ctttgtattg atcattattg acaaaccatt aaacattttt gtagacccta aaatccgata 13980 

tccaaaaaac aattcttatt 14000 



<210> 11 
<211> 29 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 11 

cactgctcaa tgatctcgtt ttctcacta 29 

<210> 12 
<211> 30 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 12 

cttgaatcac accaatgcaa tcaacacctc 30 

<210> 13 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 13 

cattttataa taacgctgcg gacatctac 29 

<210> 14 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 



<400> 14 
tttctccata ttgaccatca tactcattg 
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